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HY2 FAMILY OF BILIN REDUCTASES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims benefit of and priority to USSN / , , filed on 
May 29, 2001 and entitled HY2 FAMILY OF BILIN REDUCTASES, USSN 60/271,758 
filed on February 26, 2001, and to USSN 60/210,286, filed on June 8, 2000, all of which are 
incorporated herein by reference in their entirety for all purposes. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY 

SPONSORED RESEARCH AND DEVELOPMENT 

This invention was made, in part, with Government support under Grant 
Nos: 98-35304-6404 and AMD-9801768 awarded by the United States Department of 
Agriculture. The Government of the United States of America may have certain rights in 
this invention. 

FIELD OF THE INVENTION 

This invention relates to the field of phytochromes. In particular, this 
invention relates to the discovery of aphytochromobilin synthase and a family of related 
enzymes from photosynthetic prokaryotes that is are capable of converting a biliverdin into 
a the phytochrome- and phycobiliprotein chromophore precursors - phytochromobilin, 
phycocyanobilin and phycoerythrobilin., 

BACKGROUND OF THE INVENTION 

The phytochromes comprise a family of biliprotein photoreceptors that 
enable plants to adapt to their prevailing light environment (Kendrick and Kronenberg 
(1994) Kendrick, Pp. 828 hi Photomorphogenesis in Plants, Dordrecht, The Netherlands: 
ICluwer Academic Publishers). Phytochromes possess the ability to efficiently 
photointeroonvert between red light absorbing Pr and far red light absorbing Pfr forms, a 
property conferred by covalent association of a linear tetrapyrrole (bilin or phytobilin) with 
a large apoprotein. Phytochromes from cyanobacteria, to green algae and higher plants 
consist of a well conserved N- terminal polypeptide, roughly 390-600 amino acids in length 
(see, e.g. US. Patent 6,046,014), to which the phytobilin prosthetic group, e.g., 
phytochromobilin (POB) or phycocyanobilin (PCB) is bound. 
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^ 5 Phytobilins are linear tetrapyrrole molecules synthesized by plants, algae, 

and cyanobacteria that function as the direct precursors of the chromophores of the light- 

^' harvesting phycobiliproteins and of the photoreceptor phytochrome (Beale (1993) Chem. 
Rev. 93: 785-802; Hughes and Lamparter (1999) Plant Physiol 121: 1059-1068). The 

■ 

pathways of phytobilin biosynthesis have been elucidated by biochemical fractionation of 
1 0 plant and algal extracts, by overcoming a blocked step with exogenous putative 

intermediates, and by analysis of linear tetrapyrrole-deficient mutants (Beale and Cornejo 
(1991) J. Biol Chem. 266: 22328-22332; Beale and Cornejo (1991) J. Biol Chem. 266: 
22333-22340; Beale and Cornejo (1991) J. Biol Chem. 266: 22341-22345; Terry et al 
(1993) Arch Biochem. Biophys. 306: 1-15). These studies indicate that the biosynthesis of 
1 5 phytobilins shares common intermediates with heme and chlorophyll biosynthetic pathways 
to the level of protoporphyrin IX, at which point the latter two pathways diverge by 
metalation with iron or magnesium (Beale (1993) Chem. Rev. 93: 785-802), Phytobilins 
are derived from heme, which is converted to biliverdin IXa (BV), the first committed 
intermediate in their biosynthesis. In red algae, cyanobacteria, and plants, this 
20 interconversion is accomplished by ferredoxin-dependent heme oxygenases that are related 

4 

in sequence to the mammalian heme oxygenase (Cornejo et al (1998) Plant J. 15: 99-107.; 
Davis et al (1999) Proc. Natl Acad, Set, USA, 96: 6541-6546; Muramoto et al (1999) 
Plant Cell 1 1 : 335-347). Although they catalyze the same reaction, mammalian heme 
oxygenases use an NADPH-dependent cytochrome P450 reductase to generate reducing 

25 power for heme catabolism (Maines (19S8) FASEB J, 2: 2557-2568). 

The metabolic fate of BV differs in mammals, cyanobacteria, and plants, 
with BV being metabolized by different reductases with unique double-bond specificities 
(Figure 1). Mammalian biliverdin IXa reductase (BVR), an NAD(P)H-dependent enzyme 
that catalyzes the two-electron reduction of BV at the C10 methine bridge to produce 

30 bilirubin IXa (BR), was the first of these enzymes to be discovered (Maines and Trakshel 
(1993) Arch. Biochem. Biophys. 300: 320-326). A similar enzyme, encoded by the gene 
bvdR, was identified in cyanobacteria (Schluchter and Glazer (1997) J. Biol Chem. 272: 
13562-13569), Cyanobacteria and red algae also possess novel ferredoxin-dependent bilin 
reductases for the synthesis of the linear tetrapyrrole precursors of their phycobiliprotein 

35 light-harvesting antennae complexes (Beale and Cornejo (1991) J. Biol Chem. 266: 22328- 
22332; Beale and Cornejo (1991) J. Biol Chem. 266: 22333-22340; Beale and Cornejo 
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5 (1991) J. Biol. Chem. 266: 22341-22345; Cornejo etal (1998) Plant J. 15: 99-107). 

Primarily on the basis of studies with the red alga Cyanidium caldarium, these investigators 
proposed that the biosynthesis of the two major phycobiliprotein cluomophore precursors:, 
phycoerythrobilin (PEB) and phycocyanobilin (PCB), utilized two ferredoxin-dependent 
bilin reductases and several double-bond isomerases. The first bilin reductase catalyzes the 

1 0 two-electron reduction of BV at the CI 5 methine bridge to produce the BR isomer 15,16- 
dihydrobiliverdin (DHBV), whereas the second bilin reductase catalyzes the conversion of 
15,16-DHBV to 3Z-PEB, a formal two-electron reduction of the C2 and C3 diene system. 
In C. caldarium, an additional enzyme mediates the isonierization of 3Z-PEB to 3Z-PCB, 
both of which appear to be isomerized to their corresponding 3E isomers before assembly 

15 with the nascent phycobiliprotein apoproteins (Beale and Cornejo (1991) J. Biol. Chem. 

266: 22328-22332; Beale and Cornejo (1991) J. Biol Chem. 266: 22333-22340; Beale and : . 
Cornejo (1991) J. Biol Chem. 266: 22341-22345). 

More recent studies lend support for a similar pathway of PCB and PEB 
synthesis in cyanobacteria (Cornejo and Beale (1997) Photosynth. Res. 5 1 : 223-230). In 

20 contrast with mammals and phycobiliprotein-containing organisms, plants and green algae 
reduce BV to 3Z-PFB by the ferredoxin-independent enzyme P<DB synthase, which targets 
the 2A3\3 2 -diene system for reduction (Terry et al (1995) X Biol Chem. 270: 11111- 
11118; Woo et al (1997) J. Biol Chem. 272: 25700-25705). In plants, 3Z-PFB is 
isomerized to its 3E isomer, which appears to be the immediate precursor of the $ 

25 phytochrome chromophore (Ibid). The green alga Mesotaenium caldariorum possesses a 
second bilin reductase activity that catalyzes the reduction of the 18-vinyl group of P#|B to 
produce 3Z-PCB (Wu et al (1997) J. Biol Chem. 272: 25700-25705). These investigations C 
also revealed that 3E-PCB is the natural phytochrome chromophore precursor in this 
organism. 

30 Despite the extensive biochemical analysis of the phytobilin biosynthetic 

pathways in plants, algae, and cyanobacteria, the low levels of bilin reductase expression 
have previously hindered efforts to clone these enzymes. 

SUMMARY OF THE INVENTION 

This invention pertains to the isolation and characterization of a family of 
35 bilin reductases (designated herein as the HY2 family). These bilin reductases catalyze the 
conversion of a biliverdin to a phytobilin and can form component (s) of a phytochrome 
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■* 5 bio synthetic pathway. The bilin reductases of this invention can be used in vivo or in vitro 
to simply convert biliverdins to phytobilins or, in conjunction with other enzymes in the 
phytochrome synthetic pathway to synthesize complete phytochromes and/or pliytofluors. 

In one embodiment, this invention provides an isolated HY2 family bilin 
reductase comprising an amnio acid consensus sequence as illustrated in Figure 5 or in 

10 Figure 10 and having bilin reductase activity. In certain embodiments, the bilin reductase 
comprises at least 50% sequence conservation, preferably at least 70% sequence 
conservation, most preferably at least 90% sequence conservation as shown in Figure 10 
and/or at least 80% sequence conservation more preferably at least 100% sequence 
conservation as shown in Figure 5. In certain preferred embodiments, the bilin reductase is 

15 Peb A and/or PebB, 

In another embodiment, this invention provides a ferredoxin-dependent bilin 
reductase comprising at least 15%, preferably at least 20%. , more preferably at least 30%, 
and most preferably at least 50%, at least 75%, at least 90% or at least 95% sequence 
identity with an enzyme selected from the group consisting of HY2_ARATH, 

20 YCP2_SYNPY, YHP2_PROMA, YHP3JPROMA, YCP3_SYNPY, SLR0116, 
PcyA^ANASP, PcyA^NOSPU, PcyA_SyNY3, PcyAJ3YN8.1 9 PcyA_PROME, 
PebA_SYNPY, PebA_8YN8.1, PebA_PROMA, PebAJPROME, PewbBJSTOSPU, 
HY2_ARATH, RCCRARATH, and RCCR_HORVU, and where, .when aligned with HY2, 
comprises conserved hydrophobic residues at position 137, 157, 158, 256, and 314. In 

25 preferred embodiments, the bilin reductase, when aligned with HY2, comprises a residue 
selected from the group consisting of Pro-151, Phe-221, Ser222, and ASP- 171 and more 
preferably when aligned with HY2, comprises Pro-151, Phe-221, Ser-222, and ASP-171. 

In certain embodiments, the HY2 bilin reductases (HY2 family members) of 
this invention exclude (proviso out) one or more of the following: hvrccr, atrccr, 

30 rccr_horvu, rccr_arath, , ycp2_synpy 3 ycp3_synpy 5 and HY2, 

In still another embodiment, this invention provides an isolated bilin 
reductase having bilin reductase activity and comprising an amino acid sequence of 
polypeptide selected from the group consisting of HY2, athy2, slrOl 16, c362_anab, ycp2- 
synpy, ycp3_synpy, PcyA^ANASP, PcyAJSTOSPU, PcyA_SYNY3, PcyA_SYN81, 

35 PcyA_PROME, PebAjSYNPY, PebA_SYN81, PebA_PROMA, P eb APROME, 
PebA_NOSPU, PebB_SYNPY, PebB_SYN81, PebBJPROMA, PebBPROME, 
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PebB_NOSPU, HY2_ARATH, RCCR_ARATH, and RCCRJJORVU, or conservative . 
substitutions thereof In preferred embodiments, the bilin reductase comprises an amino 
acid sequence of a polypeptide selected from the group consisting of athy2, slxOl 16, 
c362_anab, ycp2-synpy, ycp3_synpy, PcyA_ANASP, PcyA_NOSPU, PcyA_SYNY3 5 

a 

PcyA_SYN81, PcyAJPROME, PebA_SYNPY, PebAjSYN81, PebAJPROMA, 
PebA_PROME, PebA_NOSPU, PebB_SYNPY 5 PebB_SYN81, PebB^PROMA, 
PebB_PROME, PebB^NOSPU, HY2_ARATH, RCCR_ARATH, and RCCR_HORVU. 

This invention also provides methods of converting a biliverdin to a 
phytobilin. The methods involve contacting a bilin reductase of this invention (e.g. an HY2 
family bilin reductase) with a biliverdin whereby the biliverdin is converted to a phytobilin. 
In certain embodiments, the bilin reductase is a cyanobacterial bilin reductase, and/or an 
algal bilin reductase, and/or a plant bilin reductase. The bilin reductase can be a 
recombinants expressed bilin reductase. The contacting can be in vivo or ex vivo, hi 
certain embodiments, the contacting is in a cell and the bilin reductase is a heterologous 
polypeptide. The methods can further comprise contacting the phytochromobilin with a 
second bilin reductase to produce a phytochrome. In certain embodiments, the methods 
further comprise contacting the phytochromobilin with a second bilin reductase (e.g. PebB) 
to produce a phytofluor. In certain embodiments, the bilin reductase is ycp2-snpy and/or 
ycp3-snpy. 

This invention also provides isolated nucleic acids encoding a bilin reductase 
as described herein (e.g. an HY2 family member). Preferred nucleic acids comprise a 
vector. 

Also are provided cells comprising a heterologous nucleic acid comprising a 
nucleic acid encoding a bilin reductase (e.g. an HY2 family member) as described herein. 
Preferred cells include, but are not limited to algal cells, plant cells, yeast cells, bacterial 
cells, insect cells, and mammalian cells. 

In still another embodiment, this invention provides a a nucleic acid that 
specifically hybridizes with a nucleic acid encoding any of the bilin reductases described 
herein under stringent conditions and that encodes a polypeptide having bilin reductase 
activity. In certain embodiments, the nucleic acids exclude (proviso out) nucleic acids 
encoding one or more of the following: hvrccr, atrccr, rccrhorvu, rccr_arath, , ycp2_synpy, 
ycp3_synpy, and HY2. Preferred nucleic acids are vectors (e.g. plasmids, cosmids, etc.), 
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• 5 In still another embodiment, this invention provides a method of detecting 

expression of a polypeptide. The method comprises providing a cell comprising a nucleic 
acid encoding an apophytochrome; and a nucleic acid encoding a bilin reductase that 
produces aphytobilin that assembles with said apophytochrome to produce a 
holophytochrome or a phytofluor; and detecting an optical signal produced by the 

1 0 holophytochrome or phytofluor. 

This invention also provides a method of producing a photoactive 
holophytochrome. The method involves co-expressing in a cell (e.g., an algal cell, a yeast 
cell, a bacterial cell, a plant cell, an insect cell, a mammalian cell, etc.): a heme oxygenase; 
an apophytochrome; and a ferredoxin-dependent bilin reductase; whereby the cell produces 

15 the photoactive holophytochrome and where one or more of the apophytochrome and the 
ferredoxin-dependent bilin reductase are expressed by heterologous nucleic acids, In 
preferred embodiments, the ferredoxin-dependent bilin reductase is an HY2 family bilin 
reductase (e.g. HY2, pcyA, etc.). In a preferred embodiment, the apophytochrome and the 
ferredoxin-dependent bilin reductase are both expressed by heterologous nucleic acids. In 

20 certain embodiments, the heme oxygenase is expressed by a heterologous nucleic acid. In 
certain particularly preferred embodiments, the photoactive holophytochrome is not a 
phytofluor, while in other preferred embodiments, the photoactive holophytochrome is a 
phytofluor. The apophytochrome can be expressed as a fusion protein with a protein that is 
to be labeled with the phytofluor or holophytochrome, hi certain preferred embodiments, 

25 the method comprises expressing the ferredoxin-dependent bilin reductase pebA and/or 
pebB. In a particularly preferred embodiment the cell is a bacterial cell (JE. coli). The 
method can further involve recovering the photoactive holophytochrome or phytofluor from 
the cell. 

■ 

In another embodiment this invention provides a cell {e.g., an algal cell, a 
30 yeast cell, a bacterial cell, a plant cell, an insect cell, and a mammalian cell) comprising: a 
heme oxygenase; an apophytochrome; and a ferredoxin-dependent bilin reductase; whereby 
the cell produces a photoactive holophytochrome and where one or more of the 
apophytochrome and the ferredoxin-dependent bilin reductase are expressed by 
heterologous ^nucleic acids. The ferredoxin-dependent bilin reductase is preferably an HY2 
35 family bilin reductase (e.g. HY2, pcyA, etc.). In certain embodiments, the apophytochrome 
and the ferredoxin-dependent bilin reductase are both expressed by heterologous nucleic 
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5 acids. They can both be expressed by the same heterologous nucleic acid. In certain cells 1 - 
the heme oxygenase is an endogenous heme oxygenase. In other cells, the heme oxygenase 
is expressed by a heterologous nucleic acid. The expressed holophytochrome is, in certain 
embodiments, not a phytofluor and, in other embodiments, is a phytofluor. Certain 
preferred cells express pebA and/or pebB. One preferred cell is a bacterial cell (e.g. E. 
10 coif). 

This invention also provides an isolated nucleic acid comprising: a nucleic 
acid encoding a heme oxidoreductase; and a nucleic acid encoding and a ferredoxin-:, > 
dependent bilin reductase; where the nucleic acid expresses a functional heme 
oxidoreducase and a functional bilin reductase. The heme oxidoreductase and the bilin 
15 reductase can be under control of the same, or different, promoters. The promoter can 

• i 

include a constitutive promoter, an inducible promoter, or a tissue-specific promoter; The 

■ 

nucleic acid can be present in a cell (e.g. a bacterial cell, a plant cell, a yeast cell, a : \- 
mammalian cell, an insect cell, etc.), Prefeixed nucleic acids include one or more geti&s 
selected from the group consisting of HOI, HY2, PcyA, PebA, and PebB. One prefet^d 
20 nucleic acid comprises an HOI coding region and/or a pcyA coding region and/or a pdyB . 

Definitions. i 

The term "fluorescent adduct" refers to a fluorescent molecule (i.e-,6ni 
capable of absorbing light of one wavelength and emitting light of a second wavelength) 
comprising an "apoprotein" (also referred to as an apophyto chrome) component joine<t to a 
25 "bilin" component, both of which are described below. The fluorescent phytochxom^bilin 
conjugates (e.g., phytochrome-PEB adducts), are also referred to herein as "phytofluors". 
The manner hi which the two components are joined to form an adduct is irrelevant t^ the 
present invention. Typically, the two components spontaneously form an adduct through 
covalent interactions. The components may also be deliberately linked through covalent 

30 bonds (e.g. , through the use of crosslinking reagents). The fluorescent adducts of this| 

'- '( 

invention do not require pairing of an apoprotein with its corresponding native bilin. To 
the contrary, the invention contemplates adducts consisting of naturally occurring or 
engineered apoproteins with bilins derived from different organisms, or with non-naturally 
occurring synthetic linear oligopyrroles or oligopyrrole mimetics. 
35 The terms "apoprotein", "apophytochrome", or "apoprotein polypeptide", as 

used herein, refer to polypeptides derived from eukaryotes, such as vascular plants, non- 
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5 vascular plants, and algae, or from prokaryotes, such as cyanobacteria and prochlorophytes. 
The teun encompasses both naturally occurring apoproteins and variant polypeptides, e.g. 
derived through mutagenesis. The apoproteins have a hydrophobic pocket, referred to as 
chromophore binding site, capable of forming an adduct with a bilin component. The 
apoproteins of the invention are typically homodimeric proteins about 1 100 amino acids in 

10 length, each subunit being composed of two major domains. The globular 70 kD N- 

terminal domain contains the hydrophobic pocket, while the more elongated 55 kD carboxyl 
terminal domain contains the sites at which the two subunits are associated. Preferred 
analogues are recognized by and thus comprise the consensus sequence of Figure 6 in U.S. 
Patent 6,046,014. The apoprotein can be derived from vascular and non-vascular plants, 

15 green alga, or bacteria, can be recombinantly expressed, or can be chemically synthesized 
de novo. Preferred apoproteins are encoded by plant genes, algal genes, bacterial genes, or 
cyanobacterial genes. Particularly preferred apoproteins include any of the apoproteins 
described herein or in U.S. Patent 6,046,014 or those listed in the sequence listing of U.S. 
Patent 6,046,014 or conservative substitutions of these sequences. Most preferred 

20 apoproteins include apoproteins from plants (e.g., oats with an apoprotein having about 

1 100 amino acid residues), green algae (e.g., Mesotaenium caldarioruni), or cyanobacteria 
(as illustrated in U.S. Patent 6,046,014), or related, proteins having conservative 
substitutions. Truncated apoproteins consisting of a chromophore domain; the apoprotein 
N-terminal subsequence sufficient for lysase activity are particularly preferred. One 

25 preferred N-tenninal subsequence consists of less than about 600 N-terminal amino acids, 
more preferably less than about 515 N-terminal amino acids, and most preferably less than 
about 400 N-terminal amino acids. Apophytochromes can be readily identified by one of 
skill in the art by comparison of the polypeptide sequence in question with the 
apophytochrome consensus sequence provided in Figure. 6 of U.S. Patent 6,046,014 using 

30 standard sequence comparison methodologies. For a general discussion of apoprotein 

structure and function, see, Quail et at (1997) Plant Cell and Environment, 20: 657-665. 

The "bilin" components of the adducts of the invention are linear 
polypyrxoles (e.g., di-, tri-, or tetrapyrroles) capable of fluorescing, or photointerconverting 
between spectrophotometrically distinct forms, when associated with an apoprotein. 

35 Typically, the bilin components of the invention are isolated from vascular plants, algae, or 
cyanobacteria according to standard techniques. The bilin components can also be 
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5 synthesized de novo, For a general discussion of bilins useful in the present invention see" 
Falk (1989) Pp.3 55-399 In: The Chemistry of Linear Oligopyrroies and Bile Pigments. , 

Y 

Springer- Verlag, Vienna. 

The term "chromophore domain" or "minimal chromophore domain" refers 
to the apoprotein N-terminal subsequence sufficient for lyase activity; the ability to 

10 spontaneously assemble in the presence of a bilin to form a phytoftuor, Chromophore 

domains typically comprise less than 600 amino acids of the N terminus of the apoprotein, 
preferably less than about 515 amino acids, more preferably less than about 450 amino 
acids and most preferably less than about 400, 390, 350 or even as few as 197 N-tenninal 
amino acids known as the "bilin lyase domain", see Wu and Lagarias (2000) Biochemistiy 

15 39: 13487-13495.. One preferred chromophore domain comprises the 514 N-terminal 
amino acids of a cyanobacterial phytochrome, 

The terms "polypeptide", "peptide" and "protein" are used interchangeably 

4 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an artificial chemical analogue of a 

20 corresponding naturally occurring amino acid, as well as to naturally occurring amino acid 
polymers. The term also includes variants on the traditional peptide linkage joining the 
amino acids making up the polypeptide. 

The terms "nucleic acid" or "oligonucleotide" or grammatical equivalents 
herein refer to at least two nucleotides covalently linked together. A nucleic acid .of the 

25 present invention is preferably single-stranded or double stranded and will generally contain 
phosphodiester bonds, although in some cases, as outlined below, nucleic acid analogs are 
included that may have alternate backbones, comprising, for example, phosphoramide 
(Beaucage et ah (1993) Tetrahedron 49(10): 1925) and references therein; Letsinger (1970) 
J. Org. Chem. 35:3800; Sprinzl et al. (1977) Eur. J. Biochem. 81: 579; Letsinger etal 

30 (1986) Nucl. Acids Res, 14: 3487; Sawai et al (1984) Chem. Lett. 805, Letsinger etal. 

(1988) J. Am. Chem. Soc. 1 10: 4470; and Pauwels et al (1986) Chemica Scripta 26: 1419), 
phosphorothioate (Mag etal. (1991) Nucleic Acids Res. 19:1437; and U.S. Patent No. 
5,644,048), phosphorodithioate (Briu et al. (1989) J. Am. Chem. Soc. 1 1 1 :2321, O- 
methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A 

35 Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 
linkages (see Egholm (1992) J. Am. Chem. Soc. 114:1895; Meier et aL (1992) Chem. Int. 
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* 5 Ed Engl 31: 1008; Nielsen (1993) Nature, 365: 566; Carlsson et al (1996) Nature 380: 
207). Other analog nucleic acids include those with positive backbones (Denpcy et al 
(1995) Proc. Natl Acad. Set USA 92: 6097; non-ionic backbones (U.S. Patent Nos. 
5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; Angew. (1991) Chem. Intl. Ed. 
English 30: 423; Letsinger et al. (1988) 1 Am. Chem. Soc. 1 10:4470; Letsinger et al (1994) 

10 Nucleoside & Nucleotide 13:1597; Chapters 2 and 3, ASC Symposium Series 580, 

"Carbohydrate Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook; 
Mesmaeker et al (1994), Bioorganic & Medicinal Chem. Lett 4: 395; Jeffs et al. (1994) 1 
Biomolecular NMR 34:17; Tetrahedron Lett. 37:743 (1996)) and non-ribose backbones, 
including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 

15 7 3 ASC Symposium Series 580, Carbohydrate Modifications in Antisense Research, Ed. 
Y.S, Sanghui and P. Dan Cook. Nucleic acids containing one or more carbocyclic sugars 
are also included within the definition of nucleic acids (see Jenkins et al (1995), Chem. 
Soc. Rev. ppl 69-176). Several nucleic acid analogs are described in Rawls, C & E News 
June 2, 1997 page 35. These modifications of the ribose-phosphate backbone may be done 

20 to facilitate the addition of additional moieties such as labels, or to increase the stability and 
half-life of such molecules in physiological environments. 

The term "heterologous" as it relates to nucleic acid sequences such as 
coding sequences and control sequences, denotes sequences that are not normally associated 
with a region of a recombinant construct, and/or are not normally associated with a 

25 particular cell Thus, a "heterologous " region of a nucleic acid construct is an identifiable 
segment of nucleic acid within or attached to another nucleic acid molecule that is not found 
in association with the other molecule in nature. For example, a heterologous region of a 
construct could include a coding sequence flanked by sequences not found in association 
with the coding sequence in nature. Another example of a heterologous coding sequence is 

30 a construct where the coding sequence itself is not found in nature (e.g., synthetic sequences 
having codons different from the native gene). Similarly, a host cell transformed with a 
construct which is not normally present in the host cell would be considered heterologous 
for purposes of this invention. 

The terms "isolated" "purified" or "biologically pure" refer to material which 

35 is substantially or essentially free from components which normally accompany it as found 
in its native state. With respect to nucleic acids and/or polypeptides the term can refer to 
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nucleic acids or polypeptides that are no longer flanked by the sequences typically flanking 
them in nature. 

The term "recombinant" or "recombinantly expressed" when used with 
reference to a cell indicates that the cell replicates or expresses a nucleic acid, or expresses a 
peptide or protein encoded by a nucleic acid whose origin is exogenous to the cell. 
Recombinant cells can express genes that are not found within the native (non-recombinant) 
form of the cell. Recombinant cells can also express genes found in the native form of the 
cell wherein the genes are re-introduced into the cell by artificial means, for example under 
the control of a heterologous promoter. 

An "HY2-related gene" or a "member of the HY2 family" refers to a gene 
that encodes a ferredoxin-dependent bilin reductase and that that can catalyze a two or four 
electron reduction of a linear tetrapyrrole to the biologically active precursors of the 
chromophores of phytochromes and phycobiliproteins. Typically 200-300 amino acids in 
length, these enzymes can be recognized by the characteristic signature sequence depicted 
in Figures 5 and 10. 

The terms "stringent conditions" or "hybridization under stringent 
conditions" refers to conditions under which a probe will hybridize preferentially to its 
target subsequence, and to a lesser extent to, or not at all to, other sequences. "Stringent 
hybridization' 1 and "stringent hybridization wash conditions" in the context of nucleic acid 
hybridization experiments such as Southern and northern hybridizations are se_quence 
dependent, and are different under different environmental parameters* An extensive guide 
to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory) Techniques in 
Biochemistry and Molecular Biology— Hybridization Math Nucleic Acid Probes parti 
chapter 2 Overview of principles of hybridization and the strategy of nucleic acid probe 
assays, Elsevier, New York, Generally, highly stringent hybridization and wash conditions 
are selected to be about 5°C lower than the thermal melting point (T^) for the specific 
sequence at a defined ionic strength and pH. The T m is the temperature (under defined ionic 
strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched 
probe. Very stringent conditions are selected to be equal to the T m for a particular probe. 

An example of stringent hybridization conditions for hybridization of 
complementary nucleic acids which have more than 1 00 complementary residues on a filter 
in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42°C, with, the 
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* 5 hybridization being carried out overnight. An example of highly stringent wash conditions 
is 0,15 M NaCl at 72°C fox about 15 minutes. An example of stringent wash conditions is a 
02x SSC wash at 65°C for 15 minutes (see, Sambrook et al (1989) Molecular Cloning - A 
Laboratory Manual (2nd eel.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Press, NY, for a description of SSC buffer). Often, a high stringency wash is preceded by a 

10 low stringency wash to remove background probe signal An example medium stringency 
wash for a duplex of, e.g., more than 100 nucleotides, is Ix SSC at 45°C for 15 minutes. 
An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x 
SSC at 40°C for 15 minutes, hi general, a signal to noise ratio of 2x (or higher) than that 
observed for an unrelated probe in the particular hybridization assay indicates detection of a 

15 specific hybridization. Nucleic acids which do not hybridize to each other under stringent 
conditions are still substantially identical if the polypeptides which they encode -arc 
substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the 
maximum codon degeneracy permitted by the genetic code. 

The term conservative substitution is used herein to refer to replacement of 

20 amino acids in a protein with different amino acids that do not substantially change the 
functional properties of the protein. Thus, for example, a polar amino acid might be 
substituted for a polar amino acid, a non-polar amino acid for a non-polar amino acid, and 
so forth. The following six groups each contain amino acids that are conservative 
substitutions for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) Aspartic acid 

25 (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (EC); 5) 
Isoleucine (I), Leucine (L) ? Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine 

y 

(Y), and Tryptophan (W). 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a pathway for the biosynthesis of bilin pigments. The 
30 mammalian bile pigment bilirubin and the linear tetrapyrrole precursors of the phytochrome 
and phycobiliprotein cbromophores of plants, algae, and cyanobacteria share the common 
intermediate biliverdin IXa. HY2, phytocliromobilin synthase or 3Z- 
phytochromobilin:ferredoxin oxidoreductase; PcyA, 3Z-phycocyanobilin:ferredoxin 
oxidoreductase; PebA, 15 ? 16-dihydrobiliverdin:ferredoxin oxidoreductase; PebB, 3Z- 

■ 

35 phyco-erythrobilin: ferredoxin oxidoreductase; BVR/BvdR, biliverdin IXa;NAD(P)H 

oxidoreductase. The dashed arrow with a question mark indicates a second type of putative 
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5 3Z-phycocyanobilin:ferredoxin oxidoreductase. The dashed arrow indicates a putative - 
phycoerythrobilin-phycocyano-bilin isomerase (Beale and Cornejo (1991) J. Biol Chem. 
266: 22333-22340). 

Figure 2 illustrates phytochrome biosynthesis in Arabidopsis. 
Figures 3A-and 3B illustrate the HY2 locus of Arabidopsis. Figure 3A 

10 shows a map of the region of chromosome 3 containing HY2, Two distinct mapping 

populations were screened, and mapping results with molecular markers are summarized 
schematically, indicating that HY2 lies in a region 66 kb in length. Markers starting with the 
letter c are CAPS markers developed during this study. DNA sequence information for 
bacterial artificial chromosomes (BACs) MZB10 and F3L24 is available in 

15 GenBank/EMBL/DDBJ. The HY2 gene structure with mutations is illustrated at the bottom. 
Exons are depicted as dark boxes and thick lines, which reflect coding regions and 59/39 
untranslated regions, respectively. Dotted lines indicate introns. Figure 3B shows the 
genomic sequence of HY2 and the deduced HY2 protein sequence from the Columbia (Col) 
ecotype. Uppercase letters represent exons deteimined by sequence analysis of HY2 

20 cDNAs, Introns and spacer sequences are indicated with lowercase letters. The stop codon 
is double underlined. Mutations in hy2 alleles are shown in boldface letters. Single 
nucleotide polymorphisms in both Ler and Wassilewskij a (Ws) ecotypes include the 
following: inserted T (at nucleotide 234), G364T conversion with amino acid change to 
Asn, and Gl 1 S2A conversion (silent). Single nucleotide polymorphisms in the Ler ecotype 

25 only include the following; C515A (in intron), G884A (silent), CI 145T (in intron), and 
G1717A (in intron). The single nucleotide polymorphism in Ws ecotype only is C1910T 
(silent). c 

Figures 4A and 4B show a RNA gel blot hybridization of 
MZB10.18/F3L24.1. Figure 4A: Total RNA (10 mg) from 1-week-old seedlings was 

30 analyzed by RNA gel blotting using the MZB10.18/F3L24.1 cDNA as a probe. RNA was 
prepared from seedlings of the hy2 mutants and corresponding wild-type plants. Figure 4B: 
The same RNA gel blot was probed with rDNA as a loading control of RNA. 

Figure 5 shows an alignment of HY2 and HY2-Related Proteins. Alignment 
of the HY2 protein with proteins of unknown function from oxygenic photosynthetic 

35 bacteria identified by PSI BLAST, Conserved residues in 100 or 80% of the aligned 
sequences are depicted in the consensus sequence with uppercase or lowercase letters, 
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5 respectively. Sequence similarity groups shown in the consensus sequence reflect 

conservation in 100% of the sequences. These are labeled as follows: 1,D5N; 4,R5K; 5, 
F 5 Y 5 W; and 6, L 5 I 5 V 5 M. Dark shading with white letters, gray shading with white 
letters, and gray shading with black letters reflect 100, 80, and 60% sequence conservation, 
respectively. Sequence identifiers correspond to hypothetical proteins from Synechococcus 

1 0 sp WH8020 (YCP2_S YNPY and YCP3_S YNP Y), from Prochlorococcus (YHP2_PROMA 
and YHP3JPROMA), and from Synechocystis sp PCC 6803 gene (cyanobase locus slrOl 16; 
see http://www.kazusa.or.jp/cyano/cyano.hrail). Database accession numbers are 
AB0451 12 for HY2 (DDBJ), Q02189 for YCP2_SYNPY (SWISSPROT), Q02190 for 
YCP3_SYNPY (SWISSPROT), CAB95700.1 for YHP2_PROMA (EMBL), CAB95701.1 

15 for YHP3 _PROMA (EMBL), and S76709 for slrOl 16 (Protein Information Resource). 
Asterisks are indicated every 20 residues. 

Figures 6A through 6D show transient expression of GFP fusion in onion 
cells and tobacco cells. Figure s 6A to 6C: Cells expressing GFP (onion) (Figure 6A), the 
HY2 chloroplast transit peptide fused to GFP (HY2TP-GFP) (onion) (Figure 6B), and 

20 HY2TP-GFP (tobacco) (Figure 6C) were analyzed by fluorescence microscopy using the 
green channel for GFP, Figure 6 D: The same sample as in (C) imaged using the red 
channel for chlorophyll. Bars in (6A) and (6B) 5 100 mm; bars in (6C) and (6D) 5 10 mm. 

Figure 7 shows an SDSPAGE of the purification of recombinant Arabidopsis 
mHY2. Lane 1, cell-free extract from uninduced Escherichia coli strain DH5a carrying 

25 pGEXmHY2; lane 2, cell-free extract from isopropylthiopgalactoside-induced Escherichia 
coli strain DH5a carrying pGEXniHY2; lane 3, soluble fraction of the induction; lane 4, 
GSTmHY2 after glutathione agarose affinity chromatography; lane 5, GSTraHY2 after 
PreScission protease treatment; lane 6, purified recombinant mHY2 after a second round of 
glutathioneagarose affinity chromatography; lane M, molecular mass standards. Numbers 

30 at right indicate positions of molecular weight markers (Sigma, SDS7) in kilodaltons. 

Figure 8 shows phytochrome difference spectra of cphl after incubation with 
BV metabolites. A soluble protein extract of isopropylthio-b-galactoside2-induced E, coli 
DH5a carrying pGEX-mHY2 was assayed for P<&B synthase activity as described herein. 
Recombinant Cphl was added to the bilin reaction mixture, which was incubated for 

35 another 30 min at room temperature under green safe light, and a phytochrome difference 
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5 spectrum was obtained. The absorption maximum and minimum are indicated in 
nanometers. 

Figure 9 shows that HY2 converts BV to P<DB as detected by HPLC. 
Purified recombinant HY2 protein (40 mg) was assayed for PFB synthase activity as 
described in herein, Bilins were extracted from the incubation mixture using a SepPak CIS 

10 reversed phase cartridge and analyzed by reversed phase HPLC as described herein. The 
HPLC solvent was acetone:20 mM formic acid (50:50 [v/v]) 3 and the eluate was monitored 
at 380 nm. The top traces represent the standard bilins BV and 3E-PFB, The third trace 
shows the bilirt metabolites obtained after incubation of BV with HY2. The bottom trace 
has, in addition, BV as an internal standard. 

15 Figure 10 shows a multiple sequence alignment of the identified bilin 

reductases. All identified sequences were aligned using the programs CLUSTAL W and 
MEME. Conserved residues in 90 or 70% of the aligned sequences are depicted in the 
consensus sequence with uppercase or lowercase letters, respectively. Sequence similarity 
groups, labeled 1 (D, E), 2 (R, K), 3 (F, Y ,W), and 4 (L, I, V, M)„ shown in the consensus 

20 sequence reflect conservation in >90% of the sequences. Dark shading with white letters, 
gray shading with white letters, and gray shading with black letters reflect 90, 70, and 50% 
sequence conservation, respectively. SYNY3, Synechocystis sp PCC6803; 3YNPY, 
Synechococcus sp WHS020; SYNS1, Synechococcus sp WHS 102; PROMA, 
Prochloroccocus sp SS120; PROME, Prochloroccocus sp MED4; NOSPU, Nostoc 

25 punctiforme; ANASP, Anabaena sp PCC7120; ARATH, Arabidopsis thaliana; and 
HORVU, Hordeum vulgare. Database accession numbers are GB: AF339056 for 
PcyA_ANASP (CyanoBase contig 362), GB: AF339057 for PcyAJSTOSPU (JGI contig 
632), PIR: S76709 for PcyA_SYNY3, PcyA_SYN81 is on JGI contig 51, GB: AF352050 
for PcyA^PROME (JGI contig 26), SW: Q02189 for PebAJSYNPY, PIR: S31075 

30 (fragment)/ JGI contig 72 for PebA_SYN81, EMB: CAB95700.1 for PebA_PROMA 3 
PebAJ>ROME is on JGI contig 26, GB: AF3 52049 for PebA^NOSPU (JGI contig 622), 
SW: Q02190 for PebB_SYNPY, PebB^SYNSl is on JGI contig 72 ? EMB:'CAB9570L1 for 
PebB_PROMA, PebB JPROME is on JGI contig 26, GB: AF339058 for PebB_NOSPU 
(JGI contig 622), DDBJ: AB045112 for HY2_ARATH, EMB: CAB77705.1 for 

35 RCCR_HORVU, EMB: CAB16763.1 for RCCR_ARATH. Asterisks indicate every tenth 
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9 5 amino acid; dashes indicate gaps; numbers above the line indicate amino acid sequence 
numbering starting with number one. 

Figure 11 shows aphylogenetic tree of the HY2 family of ferredoxin- 
dependent bilin reductases. 

Figure 12 shows an 8DS-PAGE of affinity-purified bilin reductases. Lane 1, 

10 GST-PcyA_SYNY3 after glutathione agarose affinity chromatography; lane 2, purified 
recombinant PcyA_ANASP after a second round of glutathione agarose affinity 
chromatography; lane 3, mHY2; lane 4, PebAJSYNPY; lane 5, PebBJSYNPY. STD, 
molecular mass standards (in kilodaltons). 

Figures 13A and 13B show phytochrome difference spectra and phytofluor 

15 fluorescence spectra of recombinant cyanobacterial phytochrome (Cphl) incubated with 
reaction metabolites. Figure 13A: BV was incubated with a soluble protein extract of 
isopropylb thiogalacto pyrano si de-induced E. coli DH5a strain carrying pGEXNN under 
standard PFB synthase assay conditions for 30 min at 28°C under green safe light. 
Recombinant apoCphl was added to the reaction and incubated for additional 30 min at 

20 room temperature under green safelight, and a phytochrome difference spectrum was 
obtained. The difference spectrum shown as a solid line was obtained with apoCphl 
incubated with PcyA_SYNY3 metabolites, the spectrum shown in dashed lines was 
obtained with mHY2 metabolites. Absorption maximum and minimum were indicated as 
run. Neither PebA_SYNPY, PebB_S YNPY, or a mixture of both was able to form a 

25 photoconvertible holophytochrome (no difference spectrum shown). Figure 13B: 
Phytofluor fluorescence spectra of recombinant cyanobacterial phytochrome (Cphl) 
incubated with PebA and PebB metabolites. The fluorescence excitation and emission 
spectra of the phytofluor were obtained after incubation of apoCphl with the reaction 
metabolites of Peb A_S YNP Y and PebBJSYHPY, The solid line represents the excitation 

30 spectrum monitored with an emission wavelength of 590 lim. The dashed line shows the 
emission spectrum obtained with an excitation wavelength of 545 nm. 

Figure 14 shows HPLC analysis of the BV metabolites of PebA, PebB, 
PcyA, and HY2 bilin reductases. Forty micrograms of purified protein was incubated at 
28°C under green safelight in a total assay volume of 5 mL. The assay system contained an 

35 NADPH-regenerating system, spinach ferredoxin-NADP + -reductase 5 spinach ferredoxin, 
and BSA. The reaction was started by adding 5 mM BV and was stopped by placing the 
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5 mixture on ice. Bilins were extracted from the incubation mixture using a SepPak CI 8 
reversed-phase column and analyzed by HPLC on a Phenomenex Ultracarb 5mm ODS20 
4.6 mm 3 250 mm column with a 4.6 mm 3 30 mm guard column. The HPLC solvent was 
acetone: 20 mM formic acid (50:50, v/v), and the effluent was monitored at 560 nm for the 
first 11.5 min and at 380 nm for the remaining time, STDS^ mixture of different bilin 

1 0 standards; HY2, metabolites obtained by mHY2; PcyA, metabolites obtained by 

PcyA_SYNY3; PebA, metabolites obtained by PebAJSYNPY; PebB, metabolites obtained 
by PebB_S YKPY; PebA 1 PebB, metabolites obtained by a 1 : 1 mixture of Peb A_S YNPY 
and PebB_SYNPY, Symbols are used for better visualization of peaks. Single symbols 
indicate the 3Eisomer (except 15,16-DHBV and BV) and double symbols indicate the 3Z 

1 5 isomer, respectively. 

Figure 15 shows holophytochrome difference spectrum taken of the protein 
purified from E. coli cells strain LMG194 induced to express apoCphl, HOI and HY2. 

Figure 16 shows a comparison of holophytochrome difference spectra taken 
of the protein isolated frora£ coli cells strain LMG194 induced to express apoCphl with 

20 either HO 1 and HY2, or HO 1 and PcyA, 

m 

DETAILED DESCRIPTION 

This invention pertains to the isolation and characterization of a family of 
bilin reductases (designated herein as the HY2 family). In certain embodiments, these bilin 
reductases catalyze the conversion of a biliverdin to a phytobilin and form a component of a 

25 phytochrome biosynthetic pathway. The bilin reductases of this invention can be used in 
vivo or in vitro to simply convert biliverdins to phytobilins or, in conjunction with other 
enzymes in the phytochrome synthetic pathway to synthesize complete phytochromes 
and/or phytofluors. This invention also pertains to the recombinant synthesis of a 
phytochrome or phytofluor. 

30 The phytochrome chromophore biosynthetic pathway shown in Figure 1 has 

been elucidated by the classical approach of overcoming a blocked step with exogenous 
putative intermediates (Terry et al (1993) Arch Biochem. Biophys., 306:1-15) and by 
analysis of phytochrome-chromophore deficient mutants (Terry (1997) Plant Cell Environ., 
20: 740-745). This pathway shares common intermediates with heme and chlorophyll 

35 biosynthesis to the level of protoporphyrin IX, at which point the latter two pathways 

diverge by metallation with iron or magnesium (Beale (1993) Chemical Rev. , 93:785-802; 
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5 Porra (1997) Photochem. PhotobioL, 65:492-516; Reinbothe and Reinbothe (1996) Eur. J. 
Biochem., 237:323-343). As shown Figure 1, heme is converted to biliverdin IXa (BV), the 
first committed intermediate in the biosynthethic pathways of the chromophores of the 
phytochromes and of the light-harvesting phycobiliproteins in cyanobacteria, red algae and 
cryptophytes. This reaction is accomplished by a ferredoxin-dependent heme oxygenase in 

10 red algae and cyanobacteria (Rhie and Beale (1995) Arch. Biochem. Biophys., 320:182-194; 
Cornejo and Beale (1997) Photosynthesis Res,, 51:223-230) and by an enzyme in plants that 
is likely to be similar in structure (Terry (1997) Plant Cell Environ., 20: 740-745). This 
contrasts with heme oxygenases found in mammalian systems that utilize cytochrome P450 
reductase for the oxygen-requiring conversion of heme to BV (Maines {1991) Amil Rev. 

15 Pharmacol and Toxicol, 37: 517-554). 

As illustrated in Scheme 1 (Fig. 1), the metabolic fate of BV differs in green 
plants, cyanobacteria and mammals, with BV being metabolized by different reductases 
with unique double bond specificities. Mammalian biliverdin IXa reductase (BVR), an 
NAD(P)H-dependent enzyme that catalyzes the reduction at the C10 methine bridge to 

20 produce bilirubin (BR), was the first to be discovered (Singleton and Laster (1965) J. Biol 
Chem., 240: 4780-4789). Mammalian BVRs are small soluble enzymes consisting of a 
single NAD(P)H and bilin binding subimit (Kutty and Maines (1981) J. Biol Chenu, 256: 
3956-3962; Maines and Trakshel (1993) .4rcA. Biochem. Biophys., 300: 320-326). Active 
recombinant versions of rat and human BVRs have been cloned and expressed in E. coli 

25 (Fakhrai and Maines (1992) J. Biol Chem, 267: 4023-4029; McCoubrey and Maines (1994) 
Eur. J. Biochem., 222: 597-603; Maines et al (1996) Eur. J. Biochem., 235: 372-381). The 
unexpected discovery of the gene bvdR in the cyanobacterium Synechocystis sp PCC 6803, 
which encodes a BVR that also catalyzes the NADPH-dependent reduction of the C10 
methine bridge of BV (Schluchter and Glazer (1997) X Biol Chem., 272: 13562-13569), 

30 has established that this enzyme has ancient evolutionary origins. Interestingly, bvdR plays 
a key role in the regulation of phycobiliprotein biosynthesis in this cyanobacterium since its 
inactivation leads to reduced accumulation of phycocyanin (Schluchter and Glazer (1997) J. 

Biol Chem., 272: 13562-13569). 

* Cyanobacteria possess additional bilin reductases for the synthesis of the 
35 linear tetrapyrrole precursors of their phycobiliprotein light-harvesting antennae complexes 
(Cornejo and Beale (1997) Photosynthesis Research, 51 : 223-230). Based on this 
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5 investigation and previous studies with the red alga Cyanidium caldarium (Beale and 
Cornejo (1991) J. Biol Chem., 266: 22328-22332; Beale and Cornejo (1991) 7. Biol 
Chem., 266: 22333-22340; Beale and Comejo(1991) J, Biol Chem., 266: 22341-22345), ^ 
Beale and colleagues have proposed that the biosynthesis of the phycobiliprotein ... 
chromophore precursors, phycoerythrobilin (FEB) and phycocyanobilin (PCB), involves 

10 two ferredoxin-dependent bilin reductases. The first of these enzymes catalyzes the : 

\. 

reduction of BV at the C15 methine bridge to give 15,16-dihydrobiliverdin (i.e. DHBY 
synthase), while the second reduces 15,16-dihydrobiliverdin (DHBV) at the C2 doubles bond 
to produce 3Z-PEB (see Scheme 1). In Cyanidium, an additional enzyme appeal's td'fi 
mediate the isomerization of 3Z-PEB to 3Z-PCB, both of which appear to be further?; !' 

1 5 isomerized to their corresponding 3E isomers prior to assembly with the nascent \A 
phycobiliprotein apoproteins, - 

By contrast with mammals and phycobiliprotein-containing organisms^ B V 
is reduced at the C2 double bond in plants and green algae to yield 3Z-POB by the ' f- 
ferredoxin-dependent enzyme POB synthase (Terry et al. (1995) J. Biol. Chem., 270&111 1- 

20 11118; Wu et al. (1997) J. Biol. Chem., 272:25700-25705). In both higher and lower/plants 
{e.g. mosses, ferns), 3Z-POB and/or its 3E-isomer have been established to be the 
immediate precursor of the phytochrome chromophore (Teny et al (1993) Arch. Bidfyem. 
Biophys., 306: 1-15). Recent studies have established that PCB is the natural phytocMfome 
chromophore precursor in the green alga Mesotaenium caldariorum^ and both P<£B 'U 

j./i- 

25 synthase and P<3>B reductase, the enzyme that catalyzes the reductive conversion of 3^-POB 

i - 
: <i 

to 3Z-PCB, have been detected in soluble protein extracts from the chloroplast of this' 
organism (Wu et al. (1997) J. Biol. Chem., 272:25700-25705). While a 3Z to 3E P<J>B 
isomerases have been hypothesized, this enzyme has not been identified in plant extracts 
(Terry et al. (1993) Arch. Biochem. Biophys., 306: 1-15; Beale (1993) Chemical Rev., 

i 

30 93:785-802), The final step of phytochrome chromophore biosynthesis is the covalerit 

attachment of POB or PCB to apophytochrome. 

Biochemical analysis of known phytochrome chromophore-deficient 

mutants, which include the hyl and hy2 mutants of A. thatiana (Koornneef et al (1980) 

Zeitschrififur Pflanzenphysiology, 100:147-160; Cheryl al (1989) Plant Cell, 1:867-880), 
35 the aurea and yg2 mutants of tomato (Koornneef et al (1985) J, Plant Physiol, 120:153- 

165; Van Tuinen et al (1996) Plant Journal, 9:173-182; Terry and Kendrick (1996) J. Biol 
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5 Chem., 271:21681-21686), thepcdl and pcd2 mutants of pea (Weller et al. (1996) Plant 

* 

Cell, 8: 55-67; Weller etal (1997) Plant J„ 11: 1171-1186), supports the conclusion that 
these mutations reflect lesions in the structural genes for either heme oxygenase or POB 
synthase (reviewed in Terry (1997) Plant Cell Environ., 20: 740-745). Indeed, the HY1 
locus of Arabidopsis has been shown to encode a ferredoxin-dependent heme oxygenase. 
10 This invention pertains to the cloning and sequence analysis of HY2 and the 

demonstration that the HY2 locus encodes phytochromobilin synthase, a ferredoxin- 
dependent bilin reductase enzyme that converts BV to POB. In addition it is demonstrated 
that protein relatives of HY2 are also biliverdin (BV) reductases. 

I. HY2 and HY2 family members. 

15 A) HY2. 

The genomic sequence of HY2 and the protein sequence are provided in 
Figure 3B. Based on cDNA sequence analysis, the HY2 protein contains 329 residues with 
a calculated molecular mass of 38. 1 kD. At its N terminus, the HY2 protein sequence is 
rich in serine, with few acidic residues (six serine and one aspartic acid among 45 residues), 

20 which suggests a possible transit peptide for localization to plastids (Gravel and von Heijne 
(1990) FEBS Lett 261: 455-458). The second amino acid after the initiation methionine is 
alanine, which is often observed in plastid transit peptides. 

The program CHLOROP was also used to predict the transit peptide of HY2, 
and it indicated that the first 45 amino acid residues of the HY2 protein form a chloroplast 

25 transit peptide (Emanuelsson et al (1999) Protein Set 8 : 978-984; 
http://www.cbs.dtu.dlc/services/ChloroP/). 

The calculated molecular mass of the mature HY2 protein is 33.0 kD and its 
predicted pi is 5.66, which are similar to those of FOB synthase purified from oat seedlings. 
The HY2 protein has no predicted transmembrane helices, which is also consistent with the 

30 observation that oat P4>B synthase is a soluble protein. 

The HY2 family and family members. 

Using the HY2 protein sequence as a query sequence, HY2 family members 
are identified using an iterative PSI-BLAST search of the nonredundant GenB ahk/EMBL 
database, e.g. using default search parameters (Altschul et al (1997) Nucleic Acids Res. 25, 
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5 33S9-3402). No #72-related gene was identified by this search in the nearly complete 
Arabidopsis genome. In contrast, this search identified HY2 -related sequences from two 

•m. 

marine cyanobacteiia, Prochlorococcus marimts sp. SS120 (EMBL accession numbers 
CAB95700.1 and CAB95701.1) and Synechococcus sp. WH8020 (3WISS-PROT accession 
numbers Q02189 and Q02190), and a related protein sequence from the cyanobacterium 

10 Synechocystis sp. PCC 6803 (cyanobase locus slr0116\ Protein Information Resource 
accession number S76709). 

Both marine cyanobacteria possess two i/72-related ORPs that appear to be 
part of multigene operons. The Synechococcus GRFs, ycp2_synpy and ycp3_synpy, are 
located within a cluster of genes involved in phycobiliprotein biosynthesis (Wilbanlcs and 

15 Glazer (1993) J, Biol Chem. 268: 1226-1235), whereas the Prochlorococcus ORFs, which 
we term yhp2 _proma and yhp3_proma, are located immediately downstream of a gene 
related to heme oxygenase (GB.AJ278499.1), These observations suggest that these genes 

■ 

are involved in phycobilin biosynthesis. 

Examination of highly conserved residues in the entire HY2 family and those 

20 within each of the five classes of bilin reductases provides information regarding residues 
important to the protein structure, ferredoxin interaction site, and substrate/product 
specificity. In this regard, only a small number of residues are conserved in the entire HY2 
family of enzymes. These include hydrophobic residues at positions 137, 157, 158, 256, 
and 3 14, ProlSl, Phe221, Ser222, and Aspl71 (Figure 10). The notable lack of conserved 

25 basic residues suggests that the propionyl moieties of the bilin substrates do not form salt 
linkages with the enzymes. The conserved hydrophobic residues proline and phenylalanine 
are likely to be involved in overall protein structure {i.e., folding). Alternately, they may 
form hydrophobic interactions with conserved regions of the various bilin substrates. 

The loss-of-function hy2-J and hy2-104 alleles ofphytochromobilin synthase 

30 from Arabidopsis support the critical role of Prol 5 1 in HY2's structure. The conserved 
serine and aspartate residues likely play catalytic roles, such as hydrogen bonding with the 
substrate and/or substrate protonation to make the bound bilin a better electron acceptor. 
Despite the wide divergence of the HY2 family, we believe that these conserved residues 
indicate that the active sites of all members of this class are similar. We believe the distinct 

35 doublebond reduction specificities of the BV reductases (le., PcyA, PebA, HY2), the 15,16- 
DHBV reductases {i.e., PebB), and the RCCR families reflect the positioning of the 
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5 respective substrates within the catalytic pocket. Because the AfB and C/D rings of BV are 
very similar but not identical, it is conceivable that the substrate binding sites of the PebA 
and HY2 enzymes are tailored to position BV in opposite orientations, favoring electron 
transfer to the bilin C/D ring or A ring, respectively. If this is true, then the PebB class 
might tether its 15,16-DHBV substrate in an orientation similar to that of the HY2 family, 
1 0 whereas RCC might be bound to RCCR in a manner similar to that in which B V is bound to 
PebA. 

1) pcyA 

We have documented that the pcyA genes of the cyanobacteria Synechocystis 
sp PCC6803, Anabaena sp PCC7120, zn&Nostoc punctiforme encode bilin reductases that 

1 5 catalyze the four electron reduction of BV to 3Z-PCB . PCB is the precursor of the 
chromophores of the pliycobiliproteins phycocyanin and allophycocyanin, which are 
abundant in all three cyanobacteria, PcyA enzymes are atypical bilin reductases because all 
others catalyze two -electron reductions. Formally, these enzymes catalyze two electron 
reductions of both the A and D rings of BV; however, we have not detected the production 

20 of semireduced intermediates such as P<MB and 18\l8 2 -DHBV, Thus, it appears that the 
partially reduced intermediates are tightly bound to the enzyme. The direct conversion of 
BV to PCB in these cyanobacteria is hi contrast to the proposed pathways of PCB 
biosynthesis in the red alga C caldarium, which involves the intermediacy of PEB, and in 
the green alga M caldariorum, in which 3Z-P<KB is an isolable intermediate. /?cyA-Telated 

25 genes also are present in the oxyphotobacterium Prochlorococcus sp. MED4, an 

unanticipated observation in view of the lack of phycobiliproteins in this organism. We 
were able to clone the Prochlorococcus sp. MED4 pcyA gene and express it as an N- 
terminal GST fusion. We determined that recombinant PcyA_PROME was able to reduce 
BV to PCB in our standard phytochrome-based assay (data not shown). It therefore 

30 possesses the same enzymatic activity as other studied PcyA enzymes. 

2) pebA and pebB 

We have observed that tho pebA and pebB genes of the cyanobacteria 
Synechococcus sp WH8020 and N. punctiforme encode bilin reductases that catalyze the 
conversions of BV to 15,16-DHBV and 15,16-DHBV to 3ZPEB, respectively (Figure 1). 
35 PebA therefore is a 15,16-DHBV:ferredoxin oxidoreductase, whereas PebB is a 3Z- 
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PEB:ferredoxin oxidoreductase. Both activities are consistent with the pathway of PEB * 

* * 

biosynthesis in the red alga C caldarium. The two peb genes also are found in the same 
operon in both phycoerythrin-producing cyanobacteria, and their close association with the 
major phycobiliprotein gene clusters supports their role in phycobilin biosynthesis. 

Without being bound to a particular theory, we believe Peb A and PebB 
function as a dual enzyme complex, in view of the synergistic metabolism of BV observed 
when the two enzymes are coincubated. A peb operon is not present in the genome of the 
cyanobacterium Synechocystis sp PCC6803, an organism that lacks phycoerythrin. This 
strongly suggests that PCB is synthesized in this cyanobacterium via the PcyA-dependent 
pathway, as opposed to the PEB pathway found in C. caldarium. In this regard, 
biochemical analyses of crude extracts from Synechocystis sp PCC6803 provide no 
evidence for the production of PEB. The MED4 and SS120 subspecies of the 
oxyphotobacteria Prochlorococcus also possess peb operons very similar to those of 
Synechococcus sp WH8020 and WH8102, except that the former possess upstream genes 
related to heme oxygenase. This strongly suggests that both oxyphotobacterial subspecies 
can synthesize PEB. 

We also believe that Prochlorococcus PebA and PebB axe likely functional 
orthologs of the Synechococcus and Nostoc enzymes. It is likely that numerous bilin 
isomerases are present in these oxygenevolving photosynthetic organisms. 

C) Identification of other members of the HY2 family. 

' ; 

Other members of the HY2 family of bilin reductases can readily be 

■ 

■ 

identified using the methods described herein (see e.g., Example 2) + In a preferred ' ■ 
embodiments, such methods involve using alignment algorithms with one or more members 
of the HY2 family as described herein to search nucleic acid and/or protein databanks to 
identify related genes/polypeptides. 

The activity of the putative bilin reductase can be confirmed, e.g. using a 
standard bilin reductase activity assay. One such bilin reductase assay is described in detail 
in Examples 1 and 2. Basically, the putative bilin reductase is combined with a biliverdin in 
a buffer system compatible with enzyme activity. The assay mixture is incubated for a 
period of time. Product analysis can be accomplished using a direct HPLC assay (see 
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" 5 Example 1) or by a coupled assay after the addition of an appropriate apophytochrome (e.g. 
recombinant cyanobacterial phytochrome such as CpM) using spectroscopic methods. 

■« 

II. Uses for HY2, 

The HY2 bilin reductases of this invention are useful tools for applications. 

The ability to engineer the biosynthesis of phycoerythrobilin (PEB) in any biliverdin- 
10 producing organism is now feasible via the introduction of one or two genes. Similarly, 

photoactive holophytochromes (e.g. bilin pigments bound to apophytochromes) can be 

produced in any ferredoxin-containing organism. 

Coexpression of bilin reductase genes witli apophytochromes enables us to 

produce holophytochromes in a wide number of cell types including, but not limited to algal 
15 cells, plant cells, bacteria, yeast 3 vertebrate cells (including mammalian cells), insect cells, 

and the like. This facilitates not only three-dimensional structural analysis of phytochrome, 

but also the reconstruction of phyto chrome signaling in a non-plant system in which we can 

exploit the power of molecular genetic analyses. Recombinant^ expressed phytochromes 

thus present an excellent model system useful for a wide variety studies. Similar 
20 approaches has proven invaluable for the structui'e-function analysis of the steroid hormone 

receptor family. 

By Introducing thepcyA gene into wild- type and chromophore-deficient 
mutant plants, it is possible to change the wavelength specificity of phyto chrome, which can 
favorably alter plant growth and development in the field environment. Introduction of the 
25 pebA and pebB genes into plants can shunt the conversion of BV to PEB, yielding 
photomorpho-genetically challenged plants with fluorescent phytochromes. This is 
especially useful for the analysis of the temporal and spatial patterns of phytochrome 
expression in plants, 

A) In vivo and ex vivo conversion of biliverdin to phytobilin. 

30 In certain embodiments, the HY2 family of bilin reductases of this invention 

can be used as simple reagents (reducing agents) to convert a biliverdin to a phytobilin. The 
enzymes can be used in vivo (e.g. in a plant) in vitro (e.g. in a cell culture), or ex vivo as a 
simple reagent. Thus, for example, one or more bilin reductases can be contacted with a 
biliverdin ex vivo in an appropriate buffer system (e.g., typically in the presence of a 

35 ferredoxin) resulting in the conversion of the biliverdin to a phytobilin (see, e.g., Example 
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5 1). The phytobilin is then readily purified e.g. using HPLC, e.g. as described herein in th€ 
Examples. 

*> 

Alternatively, a host cell can be transfected with a nucleic acid encoding one 

■ 

or more bilin reductases of this invention and/or other components of the phytochrome 
biosynthetic pathway. The bilin reductases are expressed hi the host cell where, in the 

10 presence of ferredoxin, they convert a biliverdin to a phytobilin. Such methods can be 

simply used to produce a phytobilin, or can be used in increase expression/production of a 
holophytochrome (e.g. by augmenting the phytochrome synthetic machinery already present 
in a plant cell, algal cell, photoactive bacterial cell, etc) or a phytofluor. 

Preferred host cells are cells that natively provide a heme and/or a heme 

15 oxygenase and/or a ferredoxin. Various preferred cells include, in certain embodiments, 
cells that do not normally produce a phytochrome (e.g. certain bacterial cells, mammalian 
cells, etc.) and in certain other embodiments, cells that typically express phytochromes (e.g. 
plant cells, algal cells, eta). 

B) Expression of holophvtochromes. 

20 The bilin reductases, and other enzymes identified herein, can be used to 

assemble photoactive holophytochromes including photoactive chromophore precursors and 
fluorescent phytofluor chromophore precursors. It was a surprising discovery of this 
invention that a cell transfected with nucleic acids encoding the components of a bilin 
synthetic pathway (e.g., HOI , PcyA, and/or HY2) and a nucleic acid encoding an | 

25 apophytochrome (e.g. Cphl, native and recombinant oat phytochrome A (ASPHYASTp, 
Avena saliva phyA (Asphya), Arabidopsis phyA (AtphyA), Mesotaenium caldanorumi 
phylb (Mcphylb), Synechocystis sp 6803 phyl (S6803 phyl), and the like (see, U.S. Patent 
6,046,014)) will express a phytochromobilin that assembles with the apophytochrome to 
produce a photoactive holophytochrome (e.g. chromophore or phytofluor). 

30 The holophytochrome, whether chromophore or phytofluor finds a number 

of uses. In one particularly preferred use, the chromophore or phytofluor are useful as 
detectable labels (e.g. colorometric or fluorescent labels). Such labels are useful for the 
visualization, and/or localization and/or isolation of attached ligands. In particularly 
preferred embodiments, Hie apophytochrome is expressed as a fusion protein with a 

-i 

35 polypeptide that it is desired to label. The apophytochrome can be directly fused to the 
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* 5 polypeptide or separated by a peptide linker. When the fusion protein is expressed, the 

apophytochrome component combines with the bilin to produce a chromophore or 

* phytofluor which then acts as a label for the polypeptide. 

In particularly preferred embodiments, the holophyto chrome is a phytofluor. 

■ 

■ 

Phytofluors are fluorescent apophytochrome-bilin conjugates {e.g., apophytoclirome-PEB 
10 adducts), that are intensely fluorescent, photostable proteins useful as fluorescent labels 
{e.g. as probes for biological research, see, e.g., U.S. Patent 6 ? 046 ? 014). 

hi certain embodiment the host cells are transfected with ttiepebA and pebB 
genes to shunt the conversion of biliverdin to PEB ? yielding a fluorescent phytofluor. 

O Heterologous holophytochromes as model systems. 

1 5 The methods of this invention can be used to express a holophyto chrome in 

essentially any cell including cells that, in their native state, do not harbor a phytochrome. 
Cells containing recombinant holophytochrome provide model systems having a wide 
variety of uses. For example, such cells can be used to screen for agents that alter the 
activity and/or spectral sensitivity of the phytochrome. In such assays the cells are 

20 contacted with the agent(s) in question and then assayed for changes in physiological 
activity and/or changes in phytochrome localization or conformation and/or changes in 
spectral characteristics. 

Such model systems are also useful for dissecting the metabolic pathways in 
which phytochromes are involved. 

25 Recombinant holophytochromes of this invention can be introduced into 

plants, algae ? and the like that normally harbor phytochromes as well. Such introduced 
heterologous phytochromes alter the wavelength specificity plant, which can favorably alter 
plant growth and development in the field environment. Using such methods the host range 
of various plants can be improved. 

30 III. Cloning and expression of HY2 proteins and other enzymes in the phytochrome 
biosynthetic pathway.. 

It is often desirable to provide isolated ferredoxin-dependent bilin reductases 
(e.g. HY2 family members) and/or holophytochromes (e.g. chromophores or phytofluors) of 
this invention. These polypeptides and/or phytochromes can be used to raise an immune 
35 response and thereby generate antibodies specific to Hie phytochrome or to components of 
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5 the biosynthetic system which can then be used to localize and/or visualize such elements in 
cells. In addition, as indicated above, the isolated phytochromes can be coupled to various 
moieties and act as detectable labels. The enzyme components of the bilin synthetic 
pathway can be used as chemical reagents. 

As explained below, the holophyto chromes and components of the 
10 phytochrome and/or bilin synthetic pathway can be conveniently produced using synthetic 
chemical syntheses or recombinant expression methodologies. In addition to the intact full- 
length polypeptides, in some embodiments, it is often desirably to express immunogenically 
relevant fragments (e.g. fragments that can be used to raise specific antibodies). 

A) De novo chemical synthesis. 

15 The phytochrome pathway components and/or apophytochromes the active 

bilin lyase domain or other subsequences can be synthesized using standard chemical 
peptide synthesis techniques. Where the desired subsequences are relatively short (e.g., 
when a particular antigenic determinant is desired) the molecule may be synthesized as a 
single contiguous polypeptide. Where larger molecules are desired, subsequences can be 

20 synthesized separately (in one or more units) and then fused by condensation of the amino 
terminus of one molecule with the carboxyl terminus of the other molecule thereby forming 
a peptide bond. 

Solid phase synthesis in which the C-temiinal amino acid of the sequence is 

• $ 

attached to an insoluble support followed by sequential addition of the remaining an^ino 
25 acids in the sequence is the preferred method for the chemical synthesis of the polypi jjptides 
of this invention. Techniques for solid phase synthesis are described by Barany aiid I 
Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 hi The Peptides: Analysis, Synthesis, 
Biology. Vol. 2: Special Methods in Peptide Synthesis, Part A., Merrifield, et al (1963) «/ 
Am. Chem. Soc, 85: 2149-2156, and Stewart etal (1984) Solid Phase Peptide Synthesis, 
30 2nd ed. Pierce Chem. Co., Rockford, 111. 
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5 B) Recombinant expression* 

In a preferred embodiment, the holophytochromes of this invention and/or 
components of the phytochrome synthetic pathway (e.g. HY2 family reductases) are 
synthesized using recombinant expression systems. Generally this involves creating a DNA 
sequence that encodes the desired protein(s), placing the DNA in an expression cassette 
1 0 under the control of a particular promoter, expressing the protein in a host, and, if desired 
isolating the expressed protein. 

1) Nucleic acids, 

Using the information provided. herein, (e.g. HY2 family member sequences, 
primers, etc.) the nucleic acids (e.g., encoding apoproteins, HY2 family reductases, and the 

15 like) can be prepared using standard methods known to those of skill in the art. For 

example, the HY2 family nucleic acid(s) may be cloned, or amplified by in vitro methods, 
such as the polymerase chain reaction (PCR), the ligase chain reaction (LCR), the 
transcription-based amplification system (TAS), the self-sustained sequence replication 
system (SSR) 3 etc, A wide variety of cloning and in vitro amplification methodologies are 

20 well-known to persons of skill , Examples of techniques sufficient to direct persons of skill 
through in vitro amplification methods are found in Berger, Sambrook, and Ausubel, as well 
as Mullis et al, (1987) U.S. Patent No. 4,683,202; PGR Protocols A Guide to Methods and 
Applications (Innis et al eds) Academic Press Inc. San Diego, CA (1990) (Irmis); Arnheim 
& Levinson (October 1, 1990) C&EN 36-47; The Journal Of NIH Research (1991) 3: 81- 

25 94; (Kwoh et al (1989) Proc. Natl Acad, Set USA 86; 1173; Guatelli et al (1990) Proc. 
Natl Acad. Sci. USA 87, 1874; LomelWfa/. (1989)/. Clin. Chem. t 35: 1826; Landegrenetf 
al, (1988) Science, 241 : 1077-1080; Van Brunt (1990) Biotechnology, 8: 291-294; Wu and 
Wallace, (1989) Gene, 4: 560; and Barringer et al (1990) Gene, 89: 117. 

DNA encoding desired proteins (e.g. HY2 family members) described herein 

30 can be prepared by any suitable method as described above, including, for example, cloning 
and restriction of appropriate sequences or direct chemical synthesis by methods such as the 
phosphotriester method of Narang et al (1979) Meth. Enzymol 68: 90-99; the 
phosphodiester method of Brown et a/.(1979) Meth. Enzymol 68: 109-151; the 
diethylphosphoramidite method of Beaucage et al. (1981) Tetra. Lett, 22: 1859-1862; and 

35 the solid support method of U.S. Patent No. 4,458,066. 
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5 Chemical synthesis produces a single stranded oligonucleotide. This may Be 

converted into double stranded DNA by hybridization with a complementary sequence, or 
by polymerization with a DNA polymerase using the single strand as a template. One of 
skill would recognize that while chemical synthesis of DNA is limited to sequences of about 
100 bases, longer sequences may be obtained by the ligation of shorter sequences. 

10 Alternatively, subsequences may be cloned and the appropriate subsequences 

cleaved using appropriate restriction enzymes. The fragments may then be ligated to 
produce the desired DNA sequence. 

In one embodiment, the nucleic acids of this invention can be cloned using 
DNA amplification methods such as polymerase chain reaction (PCR) (see, e.g., Example 

15 1). Thus, for example, the nucleic acid sequence or subsequence is PCR amplified, using a 
sense primer containing one restriction site (e.g., Ndel) and an antisense primer containing 
mother restriction site (e.g., Hindlll), This will produce a nucleic acid encoding the desired 
sequence (e.g. HY2 sequence) or subsequence and having terminal restriction sites. This 
nucleic acid can then be easily ligated into a vector containing a nucleic acid encoding the 

20 second molecule and having the appropriate corresponding restriction sites. Suitable PCR 

V 

primers can be determined by one of skill in the art using the sequence information and 
representative primers are provided herein. Appropriate restriction sites can also be added 
to the nucleic acid encoding the desired protein or protein subsequence by site-directed 
mutagenesis. The plasmid containing the desired sequence or subsequence (e.g. HY2 bilin 

25 reductase sequence) is cleaved with the appropriate restriction endonuclease and then 
ligated into the vector encoding the second molecule according to standard methods. 

The nucleic acid sequences encoding desired protein or protein subsequences 
may be expressed in a variety of host cells, including E. coli, other bacterial hosts, yeast, 
and various higher eukaxyotic cells such as the COS, CHO and HeLa cells lines and 

30 myeloma cell lines. The recombinant protein gene will be operably linked to appropriate 
expression control sequences for each host. For E. coli this includes a promoter such as the 
T7, ftp, or lambda promoters, a ribosome binding site and preferably a transcription 
termination signal. For eukaryotic cells, the control sequences will include a promoter and 
often an enhancer (e.g., an enhancer derived from hnmuno globulin genes, SV40, 

35 cytomegalovirus, etc.\ and a polyadenylation sequence, and may include splice donor and 
acceptor sequences. 
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" 5 T3ie isolation and expression of an HY2 nucleic acid is illustrated in 

Examples 1 and 2. 

2) Expression Vectors 

A nucleic acid of the invention encoding a one or more enzymes of a 
phytochrome biosynthetic pathway, e.g., as described above, can be incorporated into a 

10 recombinant expression vector in a form suitable for expression of the enzyme(s) (and in 
certain embodiments, assembly of a holophyto chrome) in a host cell. The term "in a form 
suitable for expression of the fusion protein in a host cell" is intended to mean that the 
recombinant expression vector includes one or more regulatory sequences op er ably linked 
to the nucleic acid encoding the enzyme(s) in a manner that allows for transcription of the 

1 5 nucleic acid into mJRNA and translation of the mRNA into the subject protein(s). The term 
"regulatory sequence" is art-recognized and intended to include promoters, and/or enhancers 
and/o other expression control elements (e.g., polyadenylation signals). Such regulatory 
sequences are known to those skilled in the art (see, .e.g., Goeddel (1990) Gene Expression 
Technology: Metk in Enzymol 185, Academic Press, San Diego, CA; Berger and Kimmel, 

20 Guide to Molecular Cloning Techniques, Methods in Enzymology J 52 Academic Press, Inc., 
San Diego, CA; Sambrook et at (1989) Molecular Cloning - A Laboratory Manual (2nd 
ed.) Vol 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, NY„e*c.)* 

The design of the expression vector may depend on such factors as the 
choice of the host cell to be transfected and/or particular protein(s) to be expressed. When 

25 used in mammalian cells, a recombinant expression vector's control functions are often 

provided by viral genetic material. Preferred promoters include, but are not limited to CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
mouse metallothionein-L Use of appropriate regulatory elements can allow for high level 
expression of the polypeptide(s) in a variety of host cells. A number of suitable expression 

30 systems are commercially available. In one preferred embodiment, the sequences encoding 
the desired polypeptide(s) are expressed in TA cloning plasmid, pCR2.1 (Invitrogen), e.g. as 

described in Example 3 . 

It will be appreciated that desired polypeptides can be operably linked to 

-i 

constitutive promoters for high level, continuous expression. Alternatively, inducible 
35 and/or tissue-specific promoters can be utilized. 
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In one embodiment, the recombinant expression vector of the invention is m a 
plasmid or cosmid. Alternatively, a recombinant expression vector of the invention can be s 
vims, or portion thereof, which allows for expression of a nucleic acid introduced into the 
viral nucleic acid. For example, replication defective retroviruses, adenoviruses and kdeno- 

- ¥ 

associated viruses can be used. : '. : 

Examples of techniques and instructions sufficient to direct persons of skill 
through cloning procedures are found in Berger and Kimmel, Guide to Molecular Cl&ting 
Techniques, Methods in Enzymology 152 Academic Press, Inc., San Diego, CA; Sambrook 
etal. (1989) Molecular Cloning - A Laboratory Manual (2nd ed.J Vol. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor Press, NY; Ausubel et al. (1994) Current Mtocols 

i * r 

in Molecular Biology, Current Protocols, a joint venture between Greene Publishing!? 
Associates, Inc. and John Wiley & Sons, Inc., U.S. patent number 5,017,478; and European 
Patent No. 0,246,864. 



3) Host cells. 



.< . - J .. 

y 



The holophytochromes and/or components of the phytochi-ome biosynfhetic 
pathway can be expressed in virtually any cell. Preferred cells, however, comprise % 
endogenous heme and/or a ferredoxin or are modified to comprise a heme and/or a 
fen-edoxin. Particularly preferred cells include, but are not limited to algal cells, bacterial 
cells, yeast cells, plant cells, vertebrate cells, and mammalian cells including human cells. 

The holophytochromes and/or components of the phytochrome bios^thetic 

■ v 

pathway are expressed in a host cell by introducing nucleic acid encoding the subject v 
polypeptide(s) into the host cell, wherein the nucleic acid is in a form suitable for ? 
expression of the subj ect polypeptide(s) in the host cell. For example, a recombinant ; 

v,'.i, 

expression vector of the invention, encoding the subject polypeptide(s), is inttoduced into a 
host cell. Alternatively, nucleic acid encoding the subject polypeptide(s) which is 
operatively linked to regulatory sequences (e.g., promoter sequences) but without additional 
vector sequences can be introduced into a host cell. 

As used herein, the term "host cell" is intended to include any cell or cell line 
so long as the cell or cell line is not incompatible with the protein(s) to be expressed, the 
selection system chosen or the culture system employed. As indicated above suitable cells 
include, but are not limited to algal cells, bacterial cells (e.g. E. coli), yeast cells (e.g., S. 
cerevisiae, S. pombe, P. pastoris, K. lactis, H. polymorpha, see, e.g., Fleer (1992) Curr. 
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* 5 Opin. Biotech. 3(5): 486-496), flmgal cells, plant cells (e.g. Arabdopsis), invertebrate cells 

(e.g. insect cells) and vertebrate cells including mammalian cells. Non-limiting examples of 

-c 

mammalian cell lines which can be used include CHO cells (Urlaub and Chasin (1980) 
Proa Natl Acad. Set USA 77: 4216-4220), 293 cells (Graham et al (1977) L Gen, Virol 
36: 59), or myeloma cells like (e.g., SP2 or NSO, see Galfre and Milstein (1981) Meih. 

10 Erizymol 73(B) :3-46), and the like. 

Examples of vectors for expression in yeast S. cerivisae include, but are not 
limited to pYepSecl (Baldari. et al (1987) Embo J. 6: 229-234), pMFa (Kurjan and 
Herskowitz, (1982) Cell 30: 933-943), pJRY88 (Schultz et ai, (1987) Gene 54:113-123), 
and pYES2 (Invitrogen Corporation, San Diego, Calif). The desired polypeptides can be 

15 expressed in insect cells (e.g., SF9 cells) using baculovirus expression vectors (see, e.g., 
O'Reilly et al. (1992) Baculoviins Expression Vectors: A Laboratory Manual, Stockton 
Press). 

4) Introduction of nucleic acid into a host cell. 

Nucleic acid(s) encoding the apophyto chrome and/or components of the bilin 

20 biosynthetic pathway n can be introduced into a host cell by standard techniques for 

transfecting cells. The term "transfecting" or "transfection" is intended to encompass all 
conventional techniques for introducing nucleic acid into host cells, including calcium 
phosphate co-precipitation, DEAE-dextran-mediated transfection, lipofection, 
electrop oration and microinjection. Suitable methods for transfecting host cells can be 

25 found in Sambrook et al (1989) Molecular Cloning: A Laboratory Manual, 2nd Edition, 
Cold Spring Harbor Laboratory press, and other laboratory textbooks. 

The number of host cells transformed with a nucleic acid of the invention 
will depend, at least in part, upon the type of recombinant expression vector used and the 
type of transf ection technique used. Nucleic acid can be introduced into a host cell 

30 transiently, or for long term. In long-term systems, the nucleic acid is stably integrated into 
the genome of the host cell or remains as a stable episome in the host cell. 

Certain vectors integrated into host cells at only a low frequency. In order to 
identify these integrants, a gene that contains a selectable marker (e.g. , drug resistance) is 
generally introduced into the host cells along with the nucleic acid of interest. Preferred 

35 selectable markers include those which confer resistance to certain drugs, such as G41S and 
hygromycin. Selectable markers can be introduced on a separate plasmid from the nucleic 

i- 
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acid of interest or, are introduced on the same plasmid. Host cells transfected with a nucleic 
acid of the invention (e.g. 9 a recombinant expression vector) and a gene for a selectable 
marker can be identified by selecting for cells using the selectable marker. For example, if 
the selectable marker encodes a gene conferring neomycin resistance, host cells which have 
taken up nucleic acid can be selected with G418. Cells that have incorporated the selectable 
marker gene will survive, while the other cells die. 

Nucleic acid encoding the polypeptides of the invention can be introduced 
into cells growing in culture in vitro by conventional transfection techniques (e.g., calcium 
phosphate precipitation, DEAE-dextran transfection, electroporation, biolistics, etc.)* 
Nucleic acid can also be transferred into cells in vivo, for example by application of a 
delivery mechanism suitable for introduction of nucleic acid into cells in vivo, such as 
retroviral vectors (see e.g., Ferry et al (1991) Proc, Natl Acad. Set, USA, 88: 8377-8381; 
and Kay et al. (1992) Human Gene Therapy 3: 641-647), adenoviral vectors (see,e,g. ? 
Rosenfeld (1992) Cell 68: 143-155; and Herz and Gerard (1993) Proc. Natl Acad ScL, 
USA, 90:2812-2816), receptor-mediated DNA uptake (see e.g., Wu, and Wu (19S8) 1 Biol 
Chem. 263: 14621; Wilson et al (1992) J. Biol Cheim 267: 963-967; and U.S. Pat. No. 
5,166,320), direct injection of DNA (see, e.g., Acsadi£tftf/. (1991) Nature 332: 815-818; 
and Wolff et al (1990) Science 247:1465-1468) or particle bombardment (biolistics) (see 
e.g., Cheng et al. (1993) Proc, Natl Acad. Set, USA r 90:4455-4459; and Zelenin et al 
(1993) FEES Letts. 315: 29-32). 

5} Recovery of expressed polypeptide or holophytochrome. 

In some instances, it is desired to recover the expressed polypeptide (e.g. 
HY2 family reductase) and/or the assembled holophytochrome or the holophyto chrome 
labeled polypeptide. Once expressed, the desired proteins and/or holophytochromes can be 
purified according to standard procedures of the art, including ammonium sulfate 
precipitation, affinity columns, column chromatography, gel electrophoresis and the like 
(see, generally, R. Scopes, (1982) Protein Purification, Springer-Verlag, N.Y.; Deutscher 
(1990) Methods in Enzymology Vol 182: Guide to Protein Purification., Academic Press, 
Inc. N.Y.), In certain embodiments, substantially pure compositions of at least about 90 to 
95% homogeneity are preferred, and 98 to 99% or more homogeneity are most preferred. 
The cloning and expression of a HY2 family members is illustrated in Examples 1 and 2, 
and the expression of a holophytochrome is illustrated in Example 3. 
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5 One of skill would recognize that modifications can be made to the 

apophytochrome and/or the HY2 reductases (or other components of the photochrome 
synthetic pathway) without diminishing their biological activity. Some modifications may 
■ be made to facilitate the cloning, and expression of the subject molecule(s). Such 
modifications are well known to those of skill in the art and include, for example, a 
10 methionine added at the amino terminus to provide an initiation site, or additional amino 
acids (e.g., poly His) placed on either terminus to create conveniently located restriction 
sites or termination codons or purification sequences. 

IV. Assembly of phytochromes and phytofluors. 

In certain preferred embodiments, this invention provides for the assembly of 
1 5 holophytochromes. It was a surprising discovery of this invention that a cell transfected 
with nucleic acids encoding the components of a bilin synthetic pathway (e.g., HOI, PcyA, 
and/or HY2) and a nucleic acid encoding an apophytochrome (e.g. Cphl)) will express a 
phytochromobilin that assembles with the apophytochrome to produce a photoactive 
holophytochrome. 

20 It has been demonstrated that recombinant apophytochromes produced in 

microorganisms can self assemble with the bilins, phycocyanobilin, phytochromobilin and 
phycoerythrobilin, to produce photoreversible holophyto chromes and intensely fluorescent 
phytofluors in vitro (Wahleithner et ah (1991) Proc. Natl Acad. Set USA, 88: 10387- 
10391; Li and Lagarias (1992) J. Biol Chem., 267: 19204-19210; Murphy and Lagarias 

25 (1997) Curr. Biol. 7: 870-876; US Patent 6,046,041). 

This invention additionally provides the genes encoding ferredoxin- 
dependent bilin reductases that convert biliverdin to phytochromobilin, phycocyanobilin or 
phycoerythrobilin (see Figure 1). In one aspect, this invention describes an in vivo 
expression system for holophytochrome. 

30 One preferred approach involved the design of a synthetic operon comprising 

HOI and PcyA coding regions from Synechocystis sp. PCC6803 (Yanofsky et al (1981) 
Nucl Acids Res., 9: 6647-6667; Baneyx (1999) Curr. Opin. Biotechnology^ 10: 411-421; 
Yeh, et al. (1997) Science, 277: 1505-1508). Cloning of hoi and pcyA genes from 
Synechocystis sp, PCC6803 in the plasmid pPROLarA122 (Clontech Laboratories) places 

35 these genes under the control of dual Ar a/Lac promoter. Upon introduction of this plasmid 
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5 into E. coli cells harboring the Cph 1 -expression plasmid, pBAD/Cphl(514), photoactive 

i 

h 

holophytochrome is expressed/assembled z>i vzvo, 

V 

Another particularly preferred approach, illustrated in Example 3, involved 
the production of a synthetic operon comprised of HOI from Synechocystis sp. PCC6S03 
and the mature HY2 coding region (mHY2) from Arabidopis thaliana that lacks the plastid 

10 targeting sequence. The cloning of HOI and mHY2 open reading frames into the plasmid 
pPROLarA122 (Clontech Laboratories) placed this operon under regulatory control of a 
dual Ara/Lac promoter. Upon introduction of this plasmid into E. coli cells harboring the 
Cphl -expression plasmid, pBAD/Cphl(514), in which Cphl(N514) is under regulatory 
control of slAj-ci promoter, the production of photoactive holophytochrome in vivo was 

15 observed. 

These approaches are illustrative and not meant to be limiting. Using the 
teaching provided herein, numerous other approaches will be available to one of skill in the 
art. For example, cell lines naturally harboring HOI can be used thereby eliminating the 
need to provide this enzyme from a heterologous nucleic acid. Alternatively, the cell can be 

20 provided exogenous biliverdin using a variety of transfection reagents (e.g. (e.g. catiomc 
lipids, lipofectamineTM, Chariot™, etc.). 

Other bilin reductases can be used, numerous apoproteins or minimal 
domains thereof sufficient to form holophytochromes and/or phytofluors can be used, and 
other components of the bilin biosynthetic pathway can be provided by heterologous nucleic 

25 acids. For example, it is cells expressing an apophytochrome, HOI, pebA and pebB will 
produce phytofluors in vivo. Similarly, co-expression of the structural gene for a 
phycobiliprotein, a phycobiliprotein bilin lyase and the genes necessary for a phytobilin 
biosynthetic pathway from heme will lead to the production of fluorescent phycobiliproteins 
in living cells. 

30 V, Kits. 

This invention also provides kits for the practice of the methods of this 
invention. In one embodiment the kits include a container containing one or more bilin 
reductases of this invention (e.g. HY2 family members) and/or nucleic acids encoding one 
or more bilin reductases of this invention. In certain embodiments, the kits comprise a 
35 container containing nucleic acids sufficient express and assemble a holophytochrome (e.g. 

w ■ 

a bilin chromophore or a phytofluor) in a host cell. Such kits, optionally include a vector 

■ 
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encoding an apoprotein and, optionally, a restriction site to insert a nucleic acid into the 
vector so the heterologous nucleic acid expresses a fusion protein with the apoprotein. 

The kits may optionally include devices and reagents to facilitate performing 
the methods of this invention. Such devices and reagents include, but are not limited to 
microtiter plates (e.g. for high-throughput applications), culture plates, culture media, cell 
lines, buffers, labels, and the like. 

In addition, the kits optionally include labeling and/or instructional materials 
providing directions (i.e., protocols) for the practice of the methods of this invention. 
Preferred instructional materials describe the expression of abilin reductase and/or the in 
vivo expression/assembly of a holophytochrome and/or a phytofluor and/or the expression 
of a polypeptide labeled (as a fusion protein) with a holophytochrome or a phytofluor. 

While the instructional materials typically comprise written or printed 
materials they are not limited to such. Any medium capable of storing such instructions and 
communicating them to an end user is contemplated by this invention. Such media include, 
but are not limited to electronic storage media (e.g., magnetic discs, tapes, cartridges, 
chips), optical media (e.g., CD ROM), and the like. Such media may include addresses to 
internet sites that provide such instructional materials. 

EXAMPLES 

The following examples are offered to illustrate, but not to limit the claimed 

invention. 

Example 1 

The Arabidopsis HY2 Gene E ncodes Plivfochi omobilin Synthase, a Ferredoxin- 

Dependent Biliyerdin Reductase 

Light perception by the plant photoreceptor phytochrome requires the 
tetrapyrrole chromophore phytochromobilin (P<tB), which is covalently attached to a large 
apoprotein. Arabidopsis mutants hyl and hy2, which are defective in P<DB biosynthesis, 
display altered responses to light due to a deficiency in photoactive phytochrome. In this 
example, we describe the isolation of the HY2 gene by map-based cloning. hy2 mutant 
alleles possess alterations within this locus, some of which affect the expression of the HY2 
transcript. HY2 encodes a soluble protein precursor of 38 kD with a putative N-terminal 
plastid transit peptide. The HY2 transit peptide is sufficient to localize the reporter green 
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5 fluorescent protein to plastids. Purified mature recombinant HY2 protein exhibits POB . m 
synthase activity (i.e., ferredoxin-dependent reduction of biliverdin IXa to P<E>B), as 

V 

confirmed by HPLC and by the ability of the bilin reaction products to combine with 
apophytochrome to yield photoactive holophytochrome. Database searches and 
hybridization studies suggest that HY2 is a unique gene in the Arabidopsis genome that is 
10 related to a family of proteins found in oxygenic photosynthetic bacteria. 

Introduction, 

Plants are exquisitely sensitive to their environment. Because they are 
sessile and use light as the energy source for photosynthesis, plants have developed well- 
refined photoreception and signaling systems to modulate their growth and development. 

1 5 The family of phytochromes, which are sensory photoreceptors for red and far red light, 
play a key role in mediating responses to light quality, quantity, direction, and duration 
throughout plant development (Kendrick and Kronenberg (1994) Photomorphogenesis in 
Plants. (Dordrecht, The Netherlands: Martinus Nijhoff Publishers); Quail et al (1995) 
Science 263: 675-680; Furuya, and Schafer (1996) Trends Plant Sci. 1: 301-307; Neff et al 

20 (2000) Genes Dev. 14: 257-271), Plant phytochromes are homodimers composed of -125- 
kD subunits each with a thioether-linked phytochromobilin (POB) prosthetic group 
(Lagarias andRapoport (1980) X Am. Chem. Soc. 102: 4821-4828). Phytochrome action 
depends on its ability to photointerconvert between the red light-absorbing form and the 
far-red-light absorbing form, a property conferred by covalently bound P<M8 in 

25 holophytochrome. 

Two pathways are involved in the biosynthesis of holophyto chrome, one for 
the apoprotein, which is encoded by a small multigene family (e.g., PHYA-E in 
Arabidopsis) (Sharrock and Quail (1989) Genes Dev. 3: 1745-1757; Clack et al (1994) 
Plant MoL Biol 25: 413-427), and another for the synthesis of the POB (Terry et al (1993) 

30 Arch Biochem. Biophys. 306: 1-15). Apophytochrome is synthesized in the cytosol, 

whereas POB is synthesized entirely within the plastid compartment, followed by its release 
to the cytosol, where holophytochrome assembly occurs (Figure 2). Based on spectroscopic 
studies of purified phytochromes, in vitro bilin assembly studies with recombinant 
apophytochromes, and physiological analyses of chromophore-deficient mutants, POB 

35 appears to be the immediate chromophore precursor of all higher plant and cryptophyte 
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5 phytochromes (Terry et al. (1993) Arch. Biochem. Biophys. 306: 1-15; Terry (1997) Plant 
Cell Environ. 20: 740-745). 

PcDB is synthesized from 5-aminolevulinic acid and shares many 
intermediates with the pathways of chlorophyll and heme biosynthesis (Elich and Lagarias 
(1987) Plant Physiol 84: 304-310; Elich etal (1989) J, Biol. Chem. 264: 183-189). These 

10 analyses established that biliverdin IXa (BV) is a P<EB precursor, suggesting the 

intennediacy of heme in the phytochrome chromophore biosynthetic pathway. Indeed, the 
first committed step of POB biosynthesis is catalyzed by a ferredoxin-dependent heme 
oxygenase, which is encoded by the HY1 gene in Arabidopsis and by its ortholog in rice 
(Davis et al (1999) Proa Natl Acad. Set, USA, 96: 6541-6546; Muramoto et al (1999) 

15 Plant Cell 11: 335-347; Izawa et al (2000) Plant 1 22: 391-399). Ferredoxin-dependent 
heme oxygenases were first identified in red algae and cyanobacteria, in which they 
catalyze the oxygen-dependent conversion of heme to BV (Beale and Cornejo (1984) Arch. 
Biochem. Biophys. 235, 371-384; Cornejo and Beale (1988) J. Biol. Chem. 263: 11915- 
1 1921; Cornejo and Beale (1997) Photosynth. Res. 51 : 223-230; Cornejo et al (1998) Plant 

20 J. 15: 99-107). BV, therefore, is the first committed intermediate in the biosynthetic 
pathways of POB as well as those of the phycobilins phycocyanobilm and 
phycoerythrobilin, which are precursors of the light-harvesting prosthetic groups of the 
phycobiliproteins in cyanobacteria, red algae, and cryptomonads (Beale (1993) Chem. Rev. 
93: 785-802). 

25 In plants, BV is subsequently reduced to 3Z-POB by the ferredoxin- 

dependent bilin reductase POB synthase, which has not yet been cloned (T Teixy and 
Lagarias (1991) J". Biol Chem. 266: 22215-22221). Although 3Z-POB can serve as a 
functional precursor of the phytochrome chromophore, its facile isomerization to 3E-POB, 
which is also a precursor of the phytochrome chromophore, likely occurs in plants (Terry et 

30 al (1995)J.BioL Chem. 270: 11111-11118). Ferredoxin-dependent bilin reductases are 

also present in cyanobacteria and red algae, where they catalyze the conversion of BV to the 
phycobilins (reviewed by Beale (1993) Chem. Rev. 93: 785-802). None of these bilin 
reductases has previously been cloned. 

Our understanding of photomorpho genesis in plants has been aided greatly 

35 by the isolation of five classic photomorphogeihe Arabidopsis mutants (hyl to hy5) that are 
impaired in response to light (Koomneef et al (1980) Z. Pflanzenphysiol 100: 147-160), 
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5 Photoreceptor-deficient mutants have proven to be powerful tools to analyze which 

photoreceptors mediate specific photomorphogenetic responses (ICoornneef and Kendrick^ 
(1994) Photomorphogenic mutants of higher plants. Pp. 601-628 In Photomorphogenesis in 
Plants, R.E. Kendrick and G.H.M Kronenberg, eds Dordrecht, The Netherlands: Kluwer 
Academic Pub,; Whitelam and Devlin (1997) Plant Cell Environ, 20: 752-758). 

10 Photochrome chromophore-deficient mutants, including hyl and hy2 in Arabidopsis, yg-2 
and aurea in tomato, pcdl and pcd2 in pea, and pewl and pew2 in Nicotiana 
plurnbaginifolia, have often been used as phyto chrome- deficient mutants (reviewed by 
Terry (1997) Plant Cell Environ. 20: 740-745). The aurea mutant of tomato has been used 
widely for physiological studies of phytochrome, for the study of other photoreceptors, and 

15 to study phytochrome signaling (Becker et al (1992) Planta 188: 39-47; Bowler and Chua 
(1994) Plant Cell 6: 1529-1541). Knowledge of the molecular basis of these mutations will 
help in the interpretation of physiological experiments with these mutants. Biochemical 
analyses have established that the hyl, pcdl, and yg-2 mutants are deficient at the step at 
which BV is synthesized from heme, whereas pcd2 and aurea mutants are unable to 

20 synthesize P<J>B from BV (Terry and Kendrick (1996) J. Biol Chem. 271 : 216S1-21686; 
van Tuinen et al (1996) Plant J. 9: 173-182; Weller et al (1996) Plant Cell 8: 55-67; 
Weller et al (1997) Plant J. 11:1 177-1186). The cloning of HY1 has provided valuable 
insight into the first committed enzyme of phytochrome chromophore biosynthesis, heme 
oxygenase (Davis etal (1999) Proa Natl Acad. Scl> USA, 96: 6541-6546: Muramoto et 

i 

25 al (1999) Plant Cell 11: 335-347). 

Of the five classic photomorphogenetic mutants, only hy2 remains to be 
cloned. It is widely believed that HY2 encodes POB synthase. However, the observation 
that a hy2 mutant is partially "rescued 59 by B V treatment suggests other possibilities (Parks 
and Quail (1991) Plant Cell 3: 1 177-1 186). Although it is similar to hyl mutants, the 

30 chlorophyll-deficient phenotype of hy2 mutants is typically less severe (Koornneef et al 
(1980) Z. Pflanzenphysiol 100: 147-160; Choiy et al (1989) Plant Cell 1: 867-880). The 
gene identification of HY2 in Arabidopsis should help to resolve these paradoxes. In this 
study, we describe the molecular basis for the phytochrome-deficient phenotype in the hyl 
mutant of Arabidopsis. We show that the HY2 gene encodes POB synthase, a ferredoxin- 

35 dependent BV reductase that is responsible for the final step in phytochrome chromophore 
biosynthesis in plastids. T his work has enabled us to identify other members of the HY2~ 
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5 related, ferredoxin-dependent bilin reductase family in phycobiliprotein-producing 
photo synthetic organisms (see Example 3 herein). 

■v 

* 

Results. 

Fine Mapping Localizes the HY2 Gene to Two Overlapping Bacterial Artificial 
Chromosome Clones 

10 We used a positional cloning strategy to isolate the HY2 gene, which 

previously had been mapped to chromosome 3. Because the hy2 long hypocotyl phenotype 
is easy to score in seedlings, the HY2 locus has served as a useful landmark for classic 
mapping. For fine mapping, we crossed the hy2-l mutant of Landsberg erecta (Ler) 
ecotype to the wild-type Columbia ecotype, and segregating F2 populations with the hy2 

1 5 phenotype were used for DNA preparation. First, we prepared DNA from -400 plants to 
perform genetic mapping of hy2 using cleaved amplified polymorphic sequence (CAPS) 
markers (Konieczny and Ausubel (1993) Plant J. 4: 403-410) that we developed and that 
are available in the database at the Arabidopsis Information Resource (TAJR; 
http://www.arabidopsis.org/maps/ CAPS_Cln-3.html), With -400 plants, HY2 was mapped 

20 to an interval of -360 kb between positional markers cMLP3E-l and cF3L24 (Figure 3 A), 
indicating that recombination frequency in this region was much lower than expected. 
Therefore, we increased the size of the mapping population to -2000 plants, This approach 
enabled us to map the HY2 locus to an interval of -66 kb between the markers cMZBlO and 
CF3L24 (Figure 3A). 

25 During these mapping studies, the sequences of two bacterial artificial 

chromosome clones, MZB10 and F3L24, spanning the HY2 locus genetically defined above, 
were deposited in the GenBank database (accession numbers AC009326 and AC011436). 
There are at least 21 putative genes in the region between the closest recombinations. We 
screened HY2 candidate genes based on the following expectations. First, HY2 should be 

30 categorized as an unknown or putative gene, because neither gene nor protein sequences of 
any ferredoxin-dependent bilin reductase were known. Second, HY2 should possess a 
plastid transit peptide, because enzymatic activity for P<MB synthase was detected in plastids ■ 
(Terry and Lagarias (1991) J. Biol Chern. 266: 22215-22221 1). Third, weak sequence 
similarity between HY2 and an unidentified open reading frame(s) (ORFs) in fully 

35 sequenced cyanobacterial bacterial genomes might be detectable, because HY2 -related bilin 
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5 reductase activities have been reported in cyanobacteria (Cornejo and Beale (1997) 

Photosynth. Res, 51: 223-230). The predicted amino acid sequences for all 21 genes in the 
HY2 region were used for TBLASTN (Altschul et al (1990) J. Mol Biol 215: 403-410) 
and CHLOROP (Emanuelsson et al (1999) Protein Set 8: 978-984; 
http://www.cbs.dtu.dk/services/ChloroP/) analyses. By these criteria, one of these genes 
10 with two distinct annotations, MZB10 J 8 (GenBaiik accession number AC009326-1 8) or 
F3L24.1 (GenBank accession number AC001 1436-1), appeared to be a strong candidate for 
HY2. 

The HY2 Gene Is Identified by DNA Sequences of Wild-Type and Mutant 
Alleles 

15 To help identify the HY2 gene, RNA gel blot analysis of wild-type and hy2 

mutant seedlings was performed using the cDNA for MZB 10.1 8/F3L24.1 as aprobe. 
Because the hy2 phenotype is readily observed in seedlings, we analyzed the accumulation 
of transcripts in Arabidopsis seedlings (Figure 4 ). Transcripts were detected in wild-type 
of three ecotypes tested. The slow migration of mKNA of Col was verified as a gel artifact 

20 (data not shown). RNA gel blotting showed that the transcript levels were decreased 

severely in the hy2-l i hy2-J06, and hy2-107 mutants and were decreased slightly in other 
mutant lines. Consequently, we focused our attention on the MZB 1 0, 1 8/F3L24. 1 gene. To 
determine if mutations were present in the MZB10.18/F3L24.1 gene in hy2 mutants, DNA 
fragments corresponding to the region from the end of the upstream gene to the beginning 

25 of the downstream gene from various hy2 alleles were amplified by polymerase chain 
reaction (PGR). The nucleotide sequences were determined directly from the PCR 
products, In all hy2 alleles tested, nucleotide substitutions or deletions were detected 
(Figure 3A and 3B). Based on these data and biochemical data presented below, we 
conclude that locus MZB10.18/F3L24.1 corresponds to the HY2 gene. 

30 As a result of the conflict in annotation of the HY2 gene in MZB 10.18 and 

F3L24.1 (z.e., the former encodes a protein of 273 amino acids, and the latter encodes a 
protein of 329 amino acids), we sought to verify experimentally the structure of the HY2 
gene. To do so, seven cDNA clones prepared from Columbia seedling mRNA were isolated 
from -300,000 clones examined. The nucleotide sequences of independent cDNA clones 

35 were determined, and they revealed a single reading frame that matched that of the 
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5 annotation for F3L24. 1 . The HY2 gene contains eight small exons ranging from 5 1 to 222 
nucleotides separated by seven ihtrons ranging from 74 to 183 nucleotides. The longest 
cDNA insert contained a full length 990-bp ORF, a 95-bp 5 '-untranslated region, a 231 -bp 
3 '-untranslated region, and a poly(A) + stretch, as shown in Figure 5 (DNA Data Bank of 
Japan [DDBJ] accession number AB0451 12). 

1 0 Figure 3 A shows the genomic structure of the HY2 gene with positions of the 

mutations in hy2 alleles. Two hy2 alleles, hy2-102 and hy2-]07, were found to have point 
mutations at 3' splice sites in the seventh and fifth introns, respectively. Such mutations in 
the G of the essential AG dinucleotide at the 3' splice site have been reported to lead to 
missplicing with a downstream AG, resulting in a frameshift in the protein (Brown (1996) 

15 Plant X 10: 771-780). hy2-105 was another possible splicing mutant, with a 25-bp deletion 
in the second intron. This mutation truncates the second intron to 57 nucleotides, much 
smaller than the average size of Arabidopsis introns (240 nucleotides). The efficiency of 
intron splicing might be reduced because of a minimum intron size requirement (Deutsch 
and Long (1999) Nucleic Acids Res. 27: 3219-3228), although we have not checked the 

20 significance of defects in pre-mRNA splicing experimentally. A fast neutron-generated 

allele, hy2-106, carries a 5-bp deletion in the first exon, making an immediate stop codon. 

j 

Four ethyl methanesulfonate-generated alleles, hy2~l, hy2-101 9 hy2-103 9 and hy2-104 9 have 
single nucleotide changes to produce amino acid substitutions compared with the 
corresponding wild- type allele. Two of these alleles, hy2~l and hy2-104, have the same 
25 mutation (P128L), whereas hy2-101 and hy2-103 possess G181R and R252Q substitutions, 
respectively. 

The HY2 Protein Is Related to a Family of Cvanobacterial Proteins, 

Based on cDNA sequence analysis, the HY2 protein contains 329 residues 
with a calculated molecular mass of 38, 1 kD. At its N terminus, the HY2 protein sequence 

■ 

30 is rich in serine, with few acidic residues (six serine and one aspartic acid among 45 

residues), which suggests a possible transit peptide for localization to plastids (Gravel and 
von Heijne (1990) FEES Lett. 261: 455-458). The second amino acid after the initiation 
methionine is alanine, which is often observed in plastid transit peptides. The program 
CHLOROP was also used to predict the transit peptide of HY2, and it indicated that the first 

35 45 amino acid residues of the HY2 protein form a chloroplast. transit peptide (Emanuelsson 
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fl/. (1 999) Protein Sci. 8 : 978-984; http://www.cbs.dtu.dk/services/ChloroP/). The 
calculated molecular mass of the mature HY2 protein is 33.0 kD and its predicted pi is 5.66, 
which are similar to those of POB synthase purified from oat seedlings. The HY2 protein 
has no predicted transmembrane helices, which is also consistent with the observ.atipi^that 
oat POB synthase is a soluble protein. 

Using the HY2 protein sequence as a query sequence, we performed atf 

• • ■" ■ £ 

iterative PSI-BLAST search of the nonredundant GenB anlc/EMBL database • • 

(http://www.ncbi.nlm.nih.gov^last/psiblast.cgi) using default search parameters (AI|s|hul 
et al. (1997) Nucleic Acids Res, 25, 33S9-3402). Surprisingly, no TJTO-related gene ts 
identified by this search in the nearly complete Arabidopsis genome. In contrast, this^earch 



identified HY2 -related sequences from two marine cyanobacteria, Prochlorococcus ii^irinus 



sp. SS120 (EMBL accession numbers CAB95700. 1 and CAB95701 .1) and Synechoc&cus 
sp. WH8020 (SWISS-PROT accession numbers Q02189 and Q02190), and a related protein 

V't' 

sequence from the cyanobacterium Synechocystis sp. PCC 6803 (cyanobase locus sh:(s<116; 
Protein Liformation Resource accession number S76709). Both marine cyanobacteria 
possess two i£K2-related ORFs that appear to be part of multigene operons. Interestingly, 
the Synechococcus ORFs, ycp2_synpy and ycp3_synpy, are located within a cluster.'pf 
genes involved in phycobiliprotein biosynthesis (Wilbanks and Glazer (1993) J. Sioj^phem. 
268: 1226-1235), whereas the Prochlorococcus ORFs, which we term yhp2 jproma ah^ 
yhp3_proma, are located immediately downstream of a gene related to heme oxygenase 
(GB:AJ278499.1). These observations strongly support the hypothesis that these getiis are 
hwolved in phycobilin biosynthesis. ' ' : 4 



Figure 5 shows an optimized multiple sequence alignment of HY2 and|HY2- 
related cyanobacterial sequences using the programs CLUSTALW (Higgins et al (T9§6) 
Meth. Enzymol 266: 383-402), MEME to guide hand alignments 
(http://meme.sdsc.eduymeme/website/), and GENEDOC for highlighting ' l 

(http://www.psc.edu/biomed/genedoc). As expected, the HY2-related cyanobacterial^ 
proteins lack the putative plastid transit peptide sequence found at the N terminus of HY2. 
Pairwise sequence identities between HY2 and the cyanobacterial ORFs are quite low 
(<20%), although the similarities between YCP2_SYNPY and YHP2_PROMA and ■ : 
between. YCP3_SYNPY and YHP3_PROMA suggest that these pairs of proteins have 

r 

similar functions. That the mutation in the hy2-l and hy2-104 alleles (P128L) lies in a 
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5 conserved proline residue is consistent with a critical role of this residue in the enzyme's 

■ 

structure. Proline residues are typically involved in m -pep tide bonds, which occur at p- 

-to 

■ 

turns in proteins. Examination of the amino acid alterations in the two other missense 
alleles, G181R in hy2-101 and R252Q in hy2-103 > reveals that neither mutation corresponds 
to a strongly conserved residue in this protein family. 

10 The HY2 Protein Is Localized to the Plastid. 

The N terminus of HY2 has a stretch of 45 amino acids with features of a 
chloroplast transit peptide. To determine whether this peptide is a functional plastid- 
targeting sequence, we fused the transit peptide coding region of HY2 to a modified gene of 
green fluorescent protein (GFP) from jellyfish under the control of modified cauliflower 

15 mosaic virus 35S promoter (Chiu et al (1996) Curr. Biol. 6: 325-330), The construct was 
introduced into onion skin cells and tobacco leaves by bombardment with DNA-coated 
particles, and transient expression was analyzed using confocal laser scanning microscopy. 
Although a control construct without the putative transit peptide showed GFP fluorescence 
throughout the cytoplasm and the nucleus of onion cells (Figure 6A), clear localization of 

20 GFP fluorescence to small dots, most likely plastids, was observed when the putative transit 
peptide was fused to GFP (Figure 6B). For better visualization, we also introduced the 
construct into tobacco leaves, where the chloroplasts are well developed in guard cells. GFP 
fluorescence was localized exclusively in oval structures (Figure 6C) that match the red 
autofluorescence from the chlorophyll of the chloroplasts (Figure 6D) 3 demonstrating that 

25 the fusion protein is efficiently targeted to chloroplasts. This finding confirms the presence 
of a functional transit peptide and implies that the HY2 gene product is localized in the 
chloroplast. 

Recombinant HY2 Exhibits PQB Synthase Activity 

The HY2 protein lacking the transit peptide, mHY2, was synthesized in 
30 Escherichia as a fusion protein with glutathione-S-transf erase (GST) and purified by affinity 
chromatography, as described in Methods. The GST tag was removed by site-specific 
protease digestion. A second round of affinity chromatography yielded protein at >90% 
homogeneity. Figure 7 shows SDS-PAGE results of the purification and processing of the 
protein. One liter of bacterial culture yielded approximately 1 mg of recombinant protein, 

* 

35 The molecular mass of the Arabidopsis mHY2 deduced from the cDNA is 33 kD. 
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However, the cloning and expression strategy for the mHY2 cDNA using pGEX-6P-l was 
responsible for an additional five N-terminal amino acids (GPLGS) after protease treatment. 

To detennine whether mHY2 has P3>B synthase activity, its ability to reduce 
BV to P<DB was first assessed with a "coupled" holophytochrome assembly assay in which 
the reaction products were incubated with recombinant cyanobacterial phytochrome 1 
(Cphl) apoprotein (Yeh et al (1997) Science 277: 1505-1508). 

Figure 8 shows a phytochrome difference spectrum obtained after incubation 
of apoCphl with the bilin products from a POB synthase assay of a crude cell-free bacterial 
extract expressing GST-mHY2. The difference spectrum has apeak at 676 nm and a valley 
at 724 nm, which is consistent with a POB-Cphl adduct (Yeh et al. (1997) Science 277: 
1505-1508). To ensure that this activity was not due to a component of the crude 
Escherichia lysate, the ability of purified mHY2 to reduce BV to P<DB was analyzed using 
the coupled assembly assay as well as an HPLC assay. A phytochrome difference spectrum 
identical to that shown in Figure 8 was obtained (data not shown). The HPLC results of the 
POB synthase assay mixture are shown in Figure 9. After incubation of HY2 for 30 min 
under standard P4>B synthase assay conditions, all of the BV was converted to POB. 
Interestingly, both 3Z- and 3E-POB isomers were produced, although the relative amount of 
the 3E-POB isomer varied for different HY2 samples and maybe an artifact of the presence 
of residual glutathione. 

Discussion. 

The hy2 mutant of Arabidopsis is one of five classic long hypocotyl mutants 
first identified by Koornneef et al. (1980) Z Pflanzenphysiol 100: 147-160. That the hy2 
mutant is photomorphogenetically impaired due to a phytochrome deficiency has been well 
documented by physiological and photobiological analyses (Koornneef etf al (1980) Z. 
Pflanzenphysiol 100: 147-160; Chory et al (1989) Plant Cell 1: 867-880; Parks and Quail 
(1991) Plant Cell 3: 1177-1186; Goto et al. (1993) Photochem. Photobiol. 57: 867-871). 
Parks and Quail (1991) Plant Cell 3 : 1 177-1 1 86, showed that the long hypocotyl phenotype 
of the hyl and hy2 mutants was in part "rescued" by BV feeding and suggested that these 
mutants have lesions in the phytochrome chromophore biosynthetic pathway. Indeed, HY1 
encodes a plastid-localized heme oxygenase that catalyzes the cleavage of heme to form BV 
(Davis et al. (1999) Proc. Natl. Acad. Set, USA, 96: 6541-6546; Muramoto et al. (1999) 
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* 5 Plant Cell 1 1 : 335-347), This example establishes that HY2 encodes P<5B synthase, a 
plastid-localized enzyme responsible for the ferredoxin-dependent conversion of BV to 
POB, the immediate precursor of the phytochrome chromophore. Although 

+ 

complementation experiments are in progress, sequence analysis of eight mutant alleles has 
revealed molecular lesions within the HY2 gene. Many of the hy2 alleles also display 

10 altered expression of the HY2 transcript, providing compelling evidence that the reduced 
expression of this gene is responsible for the long hypocotylphenotype. 

Based on the presence of a functional plastid-targeting sequence in the HY2 
protein, we can confidently conclude that the entire pathway of P<PB biosynthesis occurs 
within plastids. Nevertheless, the possibility of an alternative pathway in other subcellular 

15 compartments cannot be dismissed entirely. In this regard, there are three other heme 

oxygenase genes besides HYJ in the Arabidopsis genome whose products may play a role in 
an altemativepathway (M, Masuda, T. Muramoto, and T, Kohchi, unpublished data). 
However, our database searches revealed no other gene in the Arabidopsis genome that 
shows statistically significant similarity to HY2, Although a weak similarity between HY2 

20 and a ferredoxin-dependent bilin reductase involved in chlorophyll catabolism, red 

chlorophyll catabolite reductase, was revealed by profile analysis, red chlorophyll catabolite 
reductase does not catalyze the reduction of BV to PCDB (Wtithrich et al. (2000) Plant J, 21: 
189-198) . Therefore, it appears that HY2 is the only POB synthase gene in Arabidopsis. 

Physiological comparisons of the hyl and hy2 mutants indicate that hyl 

25 plants display more severe phytochrome-deficient phenotypes (Koornneef et al (1980) L. 
Heynh. Z. PflanzenphysioL 100: 147-160; Chory (1989) Plant Cell 1: 867-880). These 
observations are somewhat surprising in view of the apparent uniqueness of the HY2 gene 
and the existence of multiple HY1 -related proteins in the Arabidopsis genome. However, 
this may reflect the strength of the hyl and hy2 alleles examined. In this regard, the partial 

30 rescue of the hy2-l mutant treated with BV (Parks and Quail (1991) Plant Cell 3: 1 177- 
1 1 86) can be explained by the hypothesis that the P 1 3 1L missense mutation affords a 
partially active enzyme with a lower affinity for BV. Alternatively, it is possible that BV 
might be converted to POB by an enzyme unrelated to HY2 in Arabidopsis, Phytochrome 
chromophore biosynthetic mutants have been identified in other plant species (Terry (1997) 

35 Plant Cell Environ. 20: 740-745). In all cases, two classes of mutants have been identified: 
those that are deficient in heme oxygenase and those that are deficient in POB synthase. 
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5 Based on biochemical analyses, the aurea mutant of tomato and t\\opcd2 mutant of pea arfc 
deficient in P<DB synthase activity (van Tuinen et al. (1996) Plant J. 9: 173-182; Weller et 
al. (1997) Plant J. 11: 1177-1186). The observations that the corresponding heme 
oxygenase mutants in these plant species (i.e.,yg-2 and pcdl, respectively) exhibit less 
severe phenotypes further support the hypothesis that the relative allele strength of the two 

10 loci determines thephenotype. Aphenotypic comparison of null alleles of hyl and hy2 
{e.g., hyl- 1 06 and hy2-107) should help resolve this question. 

The cloning of the Arabidopsis HY2 gene will help to identify FOB synthase 
genes from other plant species and to confirm that the mutations in aurea and pcd2 occur in 
homologous genes. The aurea mutant of tomato has been used extensively to analyze 

15 phytochrome signal transduction (Bowler et al (1994) Cell 77: 73-81), and knowledge of 
the molecular basis of this mutation is of considerable interest. The molecular basis of such 
mutations should provide insight into residues critical for substrate and/or potential cofactor 
(i.e., metal ions or organic single electron carriers) interactions as well as those necessary 
for protein-protein interactions (i. e. , between HY2 and ferredoxin or between H Yl and 

20 HY2). The availability of HY1~ and #Z2-specific cDNA probes and specific antibodies to 
both enzymes will facilitate experiments to study the regulation of phytochrome 
chromophore biosynthesis. With such probes, several key questions can be addressed. Are 
the two enzymes expressed coordinately in all tissues? Is their expression spatially and 
temporally regulated? Do HY1 and HY2 proteins form a dual enzyme complex in the 

25 plastid that channels the conversion of heme to POB? Does the expression of HY1 affect 
HY2 expression and vice versa? 

The molecular cloning of HY2 has provided a breakthrough in our 
knowledge of bilin biosynthesis in general. Our bioinformatic analyses reveal that HY2 is 
related to a number of cyanobacterial genes of unknown function (Figure 5), Indeed, we 

30 believe these HY2-related proteins are enzymes involved in the biosynthesis of the 

chromophore precursors of the light-harvesting phycobiliproteins phycocyanobilin and 
phycoerythrobilin. As might be expected for enzymes with different substrate/product 
specificities, these proteins are highly diverged from HY2 (<20% sequence identity). The 
levels of identity between these proteins and HY2, which are highlighted in Figure 5, likely 

35 reflect residues involved in overall protein folding and/or ferredoxin interaction that are 
common to the entire family of enzymes. In Example 2, we demonstrate that these HY2- 

■ 
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5 related proteins are members of a growing family of ferredoxin-dependent bilin reductases 
with different double bond specificities. 

The pathway for phytochrome chromophore biosynthesis shown in Figure 1 
has been clearly documented. Now that the two key genes of the phytochrome 
chromophore biosynthetic pathway have been cloned, we can elucidate how bilin 

1 0 biosynthesis is regulated throughout the plant, a process that is critical to the plant's ability 
to respond to light. The possible role of bilins as second messengers, which was raised by 
recent studies of transgenic plants expressing mammalian biliverdin reductase (Montgomery 
et al (1999) Plant Physiol 121 : 629-639), can be addressed by manipulating the expression 
of HY1 and HY2 genes within different cells and tissues of the plant. Finally, it will be of 

15 particular interest to address the relationship of phytochrome chromophore biosynthesis and 
chlorophyll biosynthesis, not only because they share common biosynthetic intermediates 
but to determine how each pathway influences the other. 

Methods. 

Plant Materials 

20 Arabidopsis thaliana ecotypes Columbia (Col), Landsberg erecta (Lex), and 

Wassilewskij a (Ws) were obtained from our laboratory stocks. Mutant strains used in this 
work were obtained from Maarten Koomneef for hy2-l (distributed as CS68 by the 
Arabidopsis Biological Stock Center, Columbus, OH; in Ler ecotype); from Jason Reed for 
hy2-10J (EMS89S738-E isolated originally by L Reed; in Col ecotype), hy2-J02 (EMS195 

25 isolated by J. Reed; in Col ecotype), hy2-103 (IAA R -7 isolated by Allison Wilson; in Col 

ecotype), hy2-104 (IAA R -12 isolated by A. Wilson; in Col ecotype), Jty2-105 (ylO-9 isolated 
by J. Reed; in Col ecotype), and hy2-106 (FN16-3 isolated by Aron Silverstone; in her 
ecotype); and from Nam-Hai Chua for hy2-107 (segregated hy2 from T-DNA lines in his 
laboratory; in WS ecotype). Plants were grown under long day conditions at 22°C in a 

30 growth chamber. 

Map-Based Cloning 

The hy2-l mutant was outcrossed with wild-type Col ecotype, and the 
mapping population was selected from F2 families with the long hypocotyl phenotype. 
Genomic DNA was prepared using a protocol described by Edwards et al (1991) Nucleic 
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5 Acids Res. 19: 1349. We used cleaved amplified polymorphic sequence (CAPS) markers 
between Col and Ler (Konieczny and Ausubel (1993) Plant J. 4: 403-410), two CAPS 
markers (C6 and manganese— superoxide dismutase) in the Arabidopsis database, and seven 
new CAPS markers developed during this study. Primer sequences for polymerase chain 
reaction (PCR) amplification are listed here with the enzymes used for digestion indicated 

10 in parentheses: c4523, 5'-ACA GCG AGA TTC AAA GGT CCA TTA ACC GGA-3' (SEQ 
ID NO:l) and 5'-GGG CTT ACA GTG ATA TCT GCA AGA CTT CTA -3' (Hpall) (SEQ 
ID NO:2); 6MLP3E1, 5'-TAA TGC TTG CGA CAA ACA GG-3' (SEQ ID NO: 3) and 5'- 
GTT CAT CTC AGG GCC AAA AA-3' (Rsal) (SEQ ID NO:4); cMXK7, 5'-GCT TTC 
AGA AAT CAG ACC TCA A-3' (SEQ ID NO:5) and 5'-CTG GTG TGG TTG ATC GAA 

1 5 TCT-3 ' (Ddel) (SEQ ID NO:6); cMZB 1 0, 5'-CTG CCA AGC TTC ATT TGG TT-3' (SEQ 
ID NO:7) and 5'-GCA GGA GCT GCA GAC AAT CT-3' (Bsrl) (SEQ ID NO:8); 
cMZB10.18 (=HY2), 5'-CAA TGC AGG TTT AAC TTC AGC A-3' (SEQ ID NO:9) and 
5'-CCA TGG GAA AGT CTG CAA AT-3' (Ddel) (SEQ ID NO: 10); cF3L24, 5'-TCA AGC 
CCT TTT CCA ACA TC-3' (SEQ ID NO:l 1) and 5'-TTC CCC ATC TGA ACT CAA CC- 

20 3' (HirnT) (SEQ ID NO: 12); and cF8 A24, 5'-AAT GAT GCA TGG TGT TGG TG-3' (SEQ 
ID NO:13) and 5'-GCT CGA GGA AAA GTC ATC CA-3' (Mbol) (SEQ ID NO:14). 

Sequence Analysis of the HY2 Locus 

A pair of primers (5'-CGT TTG TCT CAC TGA AAC TG-3' (SEQ.J3& 
NO: 1 5) and 5'-CAA TCA TCT TGA AAT GCA GA-3' (SEQ ID NO: 1 6)) was usedlo 
25 amplify 1 .98-kb fragments of the MZB 10.18 region from mutants and their corresponding 
wild-type plants. The PCR products were subjected directly to a cycle-sequencing protocol 
with several primers, and reactions were analyzed on an ABI373S sequencing apparatus 
(Applied Biosystems, Foster City, CA). 

■ 

Isolation of HY2 cDNA 

30 A cDNA library was constructed by K. Aiido (Nara Institute of Science and 

Technology) from Col seedlings in AZAPII (Stratagene) according to the manufacturer's 
instructions. The DNA fragment containing MZB 10. 18 described above was used as a 
probe to screen -300,000 cDNA clones by plaque hybridization. Several cDNA plasmids 
were recovered by in vivo excision according to the manufacturer's instructions, 
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J 5 RNA Isolation and Analysis 

RNA was isolated from 1 -week-old whole Arabidopsis seedlings by the acid 
guanidinium thiocyanate-phenol-chloroform extraction method using Isogen (Nippon Gene, 
Tokyo, Japan). Total RNA (10 jag/lane) was electrophoresed on a 1.2% 

■ 

formaldehyde/agarose gel and transferred to a nylon membrane (Hybond-N; Amersham 
10 Corp.). Prehybridization and hybridization were then performed in Church hybridization 
solution (Church and Gilbert (1984)Proc. Natl Acad. Scl, USA t 81: 1991-1995) using 
radioactive probes (3 x 10 6 to 5 x 10 6 cpm/mL). A fragment of cDNA produced by EcoRI 
and Xhol digestion was used as a hybridization probe. Filters were washed under highly 
stringent conditions three times with 1 x SSC (lx SSC is 0.15 M NaCl and 0.015 M sodium 
15 citrate), 0.1% SDS at room temperature and twice with 0.2 x SSC, 0.1% SDS at 65°C for 15 
min. To show equal loading of RNA, an rRNA probe was used for hybridization. 

Subcellular Localization Experiment with Green Fluorescent Protein Fusion 

The coding region of HY2 for the putative transit peptide and flanking amino 
acid residues (amino acids 1 to 62) isolated by PCR was cloned into pTH2XA, a modified 
20 green fluorescent protein (GPP) vector derived from 35SQ-sGFP-S65T (Chiu et ah (1996) 
Cun\ Biol 6: 325-330). In pTH2XA, five glycine residues were included at the fusion 
junction to GFP (M. Takemura, unpublished data). The construct, which can express the 
HY2 transit peptide fused to the N terminus of a modified GFP gene under the control of the 

■ 

cauliflower mosaic virus 35S promoter, was introduced into onion bulbs and tobacco leaves. 
25 The conditions of bombardment were the same as those described by Muramoto et ah 
(1999) Plant Cell 1 1 : 335-347. Transient expression was observed after overnight 
incubation using confocal laser scanning microscopy (LSM510; Carl Zeiss, Jena, Germany). 

Construction of the pGEX-mHY2 Expression Vector 

mHY2, the mature HY2 gene without the predicted chloroplast transit 
30 peptide, was subcloned into the Escherichia coli expression vector pGEX-6-Pl (Amersham 
Pharmacia Biotech, Piscataway, NJ) to produce pGEX-mHY2. mHY2 was amplified using 
the primers mHY2BglHfwd. (5 ' -GAAGATCTG TCT CTG CTG TGT CGT ATA AGG-3', 
SEQ ID NO: 17) and HY2SmaIrev, (5'~TCC CCCGGG TTA GCC GAT AAA TTG TCC 
TGT TAA ATC-3', SEQ ID NO: 18), which contained Bglll and Smal sites (underlined), 
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5 respectively, and was cloned into BamHI-Smal-digested pGEX-6-Pl to give pGEX-mHY2. 
The integrity of the construct was verified by restriction analysis and complete DNA 
sequencing of the insert (Davis Sequencing, Inc., Davis, CA). The constructed vector 
contains the mHY2 sequence placed 3' to the glutatfrione-S-transferase (GST) gene of 
Schistosoma japonicum under the control of zPtac promotor. A recognition sequence for 
1 0 PreScission protease, which is also a GST .fusion protein, is located upstream of mHY2, 
Proteolytic cleavage yields the native Arabidopsis mHY2 with the five-ammo acid N- 
terminal extension GPLGS. 

Expression and Purification of Recombinant niHY2 

The Escherichia strain DH5a containing pGEX-mHY2 was grown at 37°C 
15 in 500-mL batches of Luria-Bertani medium containing ampicillin (100 jug/niL) to an OD 578 
of 0.6. Cultures were induced by the addition of 1 mM isopropylthio-p-galactoside and 
incubated for an additional 3 hr, and bacteria were harvested subsequently by 
centrifugation. The bacterial pellet from 3 liters of culture was resuspended in 20 mL of 
lysis buffer (50 mM Tris-HCl, pH 8,0, 100 mMNaCl, 0.05% Triton X-100, 1 raM DTT, 2 

20 mM benzamidine, 2 mM PMSF, leupeptin [2,0 ]xg/mL]„ and pepstatin A [3 M-g/mL]) and 
disrupted with a French press (3 x 20,000 p.s.i.)- Cell debris were removed by 
centrifugation for 30 min at 100,000g. The resulting supernatant was loaded directly onto a 
glutathione-agarose (Sigma) column (1 cm x 3 cm) that had been equilibrated with 5 
column volumes of PBS, Unbound protein was removed by washing the column with 5 

25 column volumes of PBS . GST-mHY2 fusion protein was eluted with 50 mM Tris-HCl, pH 

8.0, containing 10 mM reduced glutathione. GST-mHY2 -containing fractions were pooled i 
and dialyzed overnight against cleavage buffer (50 mM Tris-HCl, pH 7.0, 100 mM NaCl, 1 
mM EDTA, and 1 mM DTT). Digestion of the fusion protein was performed by adding 2 
units of PreScission protease (Amersham Pharmacia Biotech) per 100 jag of fusion protein 

30 and incubating at 4°C for 5 h. Removal of uncleaved fusion protein and excised GST tag 
was achieved by loading the digestion mixture onto a second glutathione-agarose column (1 
cm x 3 cm). Recombinant mHy2 was detected in the flow through, analyzed by SDS- 
PAGE, and concentrated using Centriprep-10 concentrator devices (Amicon, Beverly, MA), 
One liter of bacterial culture yielded approximately 1 mg of purified protein. 
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5 Determination of Protein Concentration 

Protein concentration was determined using the method of Bradford (1976) 
Anal Biochem. 72: 248-254, or by absorption at 280 itm for purified mHY2, where 1 
absorption unit represents 0.64 mg/mL mHy2 (Gill and von Hippel (1989) Anal Biochem. 
182:319-326). 

10 FOB Synthase Activity Assay 

All enzymes used for POB synthase assay were obtained from Sigma. For a 
1-mL assay of P<M3 synthase, the protein fraction to be assayed was diluted into 50 mM 
Tes-KOH, pH 7.3, containing an NADPH-regenerating system (6.5 mM glucoses- 
phosphate, 0.82 mM NADP+ 1.1 units/mL glucose-6-phosphate dehydrogenase type XH 
15 from Torula yeast [EC 1,1.1,49]), a ferredoxin-reducing system (4.6 jxM spinach ferredoxin, 
0.025 units/mL spinach ferredoxin:NADP + oxidoreductase [EC 1,18.1,2]), and 10 juiMBSA 

(fraction V, heat shock). Glucose-6-phosphate and NADP+ were prepared as 100- and 25- 
mM stocks, respectively, in water; both were stored at 4°C. The glucose-6-phosphate stock 
was filter sterilized before storage. Glucose~6-phosphate dehydrogenase was prepared as a 

20 500-unit/mL stock in 5 mM sodium citrate, pH 7.4, and stored at 4°C, Spinach 

ferredoximNADP 4 " oxidoreductase was prepared as a 10-unit/mL stock with sterile water 
and stored at 4°C BSA was made up as a 100-jjM stock solution in 0.1 M potassium 
phosphate buffer, pH 7.4, and stored at either 4 or -20° C. The reaction was initiated by the 
addition of 5 jjM (final concentration) purified biliverdin IXa (McDonagh and Palma 

25 (1980) Biochem. J. 189: 193-208) m5\xL of DMSO. Assay mixtures were incubated in a 
2S°C water bath under green safe light or under subdued light for the desired amount of 
time. The assays were stopped by placing them on ice. Product analysis used a direct 
HPLC assay or a coupled assay after the addition of recombinant cyanobacterial 
apophytochrome 1 (Cphl) and difference spectroscopy (see below). 

30 Direct HPLC Assay 

For the quantitative analysis of POB synthase activity, assay mixtures 
(outlined above) were loaded onto a Waters (Milford, MA) Cis Sep-Pak Light (catalog No. 
WAT023501) preconditioned as follows: 3-mL wash with acetonitrile to wet the Sep-Pak, 
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5 3-mL wash with MilliQ water, and 3-mL wash with 50 rnM 4-nietiaylinoi-plioliiie/glacial v 
acetic acid (pH 7.7). After the sample was loaded onto the Sep-Pak, it was washed with 3 

■p 

mL of 4-methylmorpho line/glacial acetic acid (pH 7,7) followed by 3 mL of 0.1% (v/v) 
trifluoro acetic acid. The Sep-Pak was then eluted with 2 mL of 100% acetonitrile. The 
eluate was dried using a Speed- Vac lyophilizer. The dried samples were analyzed by 

10 HPLC. Samples were first dissolved in 10 of DMSO and then diluted with 200 pL of 
the HPLC mobile phase (acetone:20 mM formic acid [50:50, v/v]). After the samples were 
dissolved, they were centrifuged briefly, passed through a 0.45-jim polytetrafhioro ethylene 
syringe filter, and chromato graphed using a Varian (Palo Alto, CA) 5000 liquid 
chromatograph. The column eluate was monitored at 380 run using a Varian UV100 flow- 

15 through absorb ance detector. Peak areas were quantitated using a 3365 Chemstation II 
(Hewlett-Packard, Waldbronn, Germany). The HPLC column used for all of the analyses 
was a Phenomenex (Torrance, CA) Ultracarb S^m ODS (20) 4.6-mm x 250-mm analytical 
column with a 4.6-mm x 30-mm guard column of the same material. The mobile phase 
used with this column was acetone:20 mM formic acid (50:50, v/v). The flow rate was 0.8 

20 mL/min. 

Coupled Difference Spectral Assay 

An alternative to the direct analysis of POB synthase activity was the 
coupled, or indirect, assay. This assay was based on the method outlined previously (Terry 
and Lagarias (1991) 1 Biol Chem. 266: 22215-22221). The assay described above for 

25 P(J>B- synthase was performed as before, but instead of working up the sample by Sep-Pak, 
an aliquot of recombinant apophytochrome (Cphl from Synechocystis sp. PCC 6803) was 
added to the sample. The sample was incubated for an additional 20 to 30 min at room 
temperature under green safe light, and then a difference spectrum was taken. The metiiod 
for difference spectroscopy was described previously (Terry and Lagarias (1991) J, Biol 

30 Chem. 266: 22215-22221). 

Example 2 

Functional Genomic Analysis of the HY2 Family of Ferredoxin-Dependent Bilin 

Reductases from Oxygenic Photosynthetic Organisms 

Phytobilins are linear tetrapyrrole precursors of the light-harvesting 
35 prosthetic groups of the phytochrome photoreceptors of plants and the phycobiliprotein 
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5 photosynthetic antennae of cyanobacteria, red algae, and cryptomonads. Previous 

biochemical studies have established that phytobilins are synthesized from heme via the 
intermediacy of biliverdin DCa (BV), which is reduced subsequently by ferredoxin- 
dependent bilin reductases with different double-bond specificities. By exploiting the 
sequence of phytochromobilin synthase (HY2) of Arabidopsis, an enzyme that catalyzes the 

10 ferredoxin-dependent conversion of BV to the phytochrome chromophore precursor 

phytochromobilin, genes encoding putative bilin reductases were identified in the genomes 
of various cyanobacteria, oxyphotobacteria, and plants. Phylogenetic analyses resolved four 
classes of 73T2-related genes, one of which encodes red chlorophyll catabolite reductases, 
which are bilin reductases involved in chlorophyll catabolism in plants. To test the catalytic 

15 activities of these putative enzymes, representative #y2-related genes from each class were 
amplified by the polymerase chain reaction and expressed in Escherichia coli. Using a 
coupled apophytochrome assembly assay and HPLC analysis, we examined the ability of 
the recombinant proteins to catalyze the ferredoxin-dependent reduction of BV to 
phytobilins. These investigations defined three new classes of bilin reductases with distinct 

20 substrate/product specificities that are involved in the biosynthesis of the phycobiliprotein 
chromophore precursors phycoerythrobilin and phycocyanobilin. Implications of these 
results are discussed with regard to the pathways of phytobilin biosynthesis and their 
evolution. 

Introduction. 

25 Phytobilins are linear tetrapyrrole molecules synthesized by plants, algae, 

and cyanobacteria that function as the direct precursors of the chromophores of the light- 
harvesting phycobiliprotems and of the photoreceptor phytochrome (Beale (1993) Chem. 
Rev, 93: 785-802; Hughes and Lamparter (1999) Plant Physiol 121: 1059-1068). The 
pathways of phytobilin biosynthesis have been elucidated by biochemical fractionation of 

30 plant and algal extracts, by overcoming a blocked step with exogenous putative 

intermediates, and by analysis of linear tetrapyrrole-deficient mutants (Beale and Cornejo 
(1991) J. Biol Chem, 266: 22328-22332; Beale and Cornejo (1991) X Biol Chem. 266: 
22333-22340; Beale and Cornejo (1991) J. Biol Chem. 266: 22341-22345 Terry et al 
(1993) Arch. Biochem. Biophys. 306: 1-15). These studies indicate that the biosynthesis of 

35 phytobilins shares compion intermediates with heme and chlorophyll biosynthetic pathways 
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5 to the level of protoporphyrin IX, at which point the latter two pathways diverge by 
metalation with iron or magnesium (Beale (1993) Chetn. Rev. 93: 785-802). 

'* 

Phytobilins are derived from heme, which is converted to biliverdin IX (BY), 
the first committed intermediate in their biosynthesis. In red algae, cyanobacteria, and 

■ 

plants, this interconversion is accomplished by ferredoxin-dependent heme oxygenases that 
10 are related in sequence to the mammalian heme oxygenase (Cornejo et al (1998) Plant J. 
15: 99-107; Davis et al. (1999) Proc. Natl Acad. Set, USA, 96: 6541-6546; Muramoto et 
al (1999) Plant Cell 1 1 : 335-347). Although they catalyze the same reaction, mammalian 
heme oxygenases use an NADPH-dependent cytochrome P450 reductase to generate 
reducing power for heme catabolism (Maines (1988) FASEB J. 2: 2557-2568). 
15 The metabolic fate of BV differs in mammals, cyanobacteria, and plants, 

with BV being metabolized by different reductases with unique double-bond specificities 
(Fig 1). Mammalian biliverdin DC reductase (BVR), an NAD(P)H-dependent enzyme that 
catalyzes the two -electron reduction of BV at the CIO methine bridge to produce bilirubin 
IX (BR), was the first of these enzymes to be discovered (Maines and Trakshel (1993) Arch. 
20 Biochem. Biophys. 300: 320-326). A similar enzyme, encoded by the gene bvdR, was 

■ 

identified recently in cyanobacteria (Schluchter and Glazer (1997) J. Biol Chem. 272: 
13562-13569). Cyanobacteria and red algae also possess novel ferredoxin-dependent bilin 
reductases for the synthesis of the linear tetrapyrrole precursors of their phycobiliprotein 
light-harvesting antennae complexes (Beale and Cornejo (1991) J. Biol Chem. 266: 22328- 

25 22332; Beale and Cornejo (1991) J, Biol Chem. 266: 22333-22340; Beale and Cornejo 
(1991) J. Biol Chem. 266: 22341-22345; Cornejo et al (1998) Plant J. 15: 99-107). 

Primarily on the basis of studies with the red alga Cyanidium caldarium, 
these investigators proposed that the biosynthesis of the two major phycobiliprotein 
chromophore precursors, phycoerythrobilin (PEB) and phycocyanobilin (PCB), requires 

30 two ferredoxin-dependent bilin reductases and several double-bond isomerases. T he first 
bilin reductase catalyzes the two-electron reduction of BV at the C 15 methine bridge to 
produce the BR isomer 15,16-dihydrobiliverdin (DHBV), whereas the second bilin 
reductase catalyzes the conversion of 15,16-DHBV to 3Z-PEB, a formal two-electron 
reduction of the C2 and C31 diene system. In C caldarium, an additional enzyme mediates 

35 the isomerization of 3Z-PEB to 3Z-PCB, both of which appear to be isomerized to their 
corresponding 3E isomers before assembly with the nascent phycobiliprotein apoproteins 
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5 (Beale and Comejo (1991) J. Biol Chem. 266: 22328-22332; Beale and Cornejo (1991) J. 
Biol Chem. 266: 22333-22340; Beale and Cornejo (1991) J. Biol Chem. 266: 22341- 
22345). 

More recent studies lend support for a similar pathway of PCB and PEB 
synthesis in cyanobacteria (Cornejo and Beale (1997) Photosynth Res. 51 : 223-230), In 

1 0 contrast with mammals and phycobiliprotein-containing organisms, plants and green algae 
reduce BV to 3Z-POB by the ferredoxin-dependent enzyme POB synthase, which targets 
the 2,3,3 1 ,32-diene system for reduction (Terry et al (1995) J. Biol Chem. 270: 11111- 
111 IS; Wu et al (1997) J. Biol Chem. 272: 25700-25705). In plants, 3Z-POB is 
isomerized to its 3E isomer, which appears to be the immediate precursor of the 

15 phytochrome chromophore (Terry et al (1995) J. Biol Chem. 270: 11111-111 18). The 
green alga Mesotaenium caldariorum possesses a second bilin reductase activity that 
catalyzes the reduction of the 18-vinyl group of POB to produce 3Z-PCB (Wu et al. 1997 
R15R15). These investigations also revealed that 3E-PCB is the natural phytochrome 
chromophore precursor in this organism. 

20 Despite the extensive biochemical analysis of the phytobilin biosynthetic 

pathways in plants, algae, and cyanobacteria, the low levels of bilin reductase expression 
have hindered efforts to clone these enzymes. Using a genetic approach the HY2 locus of 
Arabidopsis , which encodes the enzyme POB synthase was cloned (Example 1). 

The studies reported here were undertaken to identify HY2-related genes in 

25 the protein and nucleic acid databases. Using cloning, expression, and biochemical 

characterization, our investigations revealed three new classes of ferredoxin-dependent bilin 
reductases with either unique substrate or product specificities. 

Results. 

The HY2-reIated gene family in cyanobacteria, oxvphotobacteria, and plants 

30 Example 1 describes the cloning of the HY2 gene of Arabidopsis, Using the 

deduced protein sequence of HY2, TBLASTN, BLASTP, and PSI-BLAST searches 
(Altschul et al. (1990) J, Mol Biol 215: 403-410; Altschul et al (1997) Nucleic Acids Res. 
25: 3389-3402) were performed to identify putative bilin reductases in the nonredundant 
National Center for Biotechnology Information database, in CyanoBase (Nakamura et al. 

35 2000 R20R20), and in the Joint Genome Institute Microbial Genome database 
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(http://spider jgi-psf.org/JGI__microbial/html). These searches identified 15 putative 
proteins from various photosynthetic bacteria and two known proteins from plants. 

Figure 10 shows a multiple sequence alignment of this family of proteins 
using CLUSTAJL W (Higgins et al (1996) Meth. Enzymoi 266: 383-402), hand adjustment 
with MEME (Bailey and Elkan (1995) pp. 21-29 in: Proc. Third Internal. Conf. on 
Intelligent Systems for Molecular Biology, Menlo Park, CA: American Association of 
Artificial Intelligence Press), and highlighting with GENEDOC 

(http://www,psc.ediv f biomed/genedoc). This alignment revealed regions of strong similarity 
interspersed with highly diverged regions, with an average pairwise similarity score of 25%. 
No sequence similarity of these proteins was observed with mammalian biliverdin 
reductases. 

On the basis of the biochemical data presented here, we name these HY2- 
related cyanobacterial loci after their roles in the biosynthesis of PCB (ie., pcyA) and PEB 
(i. e. , pebA mdpebB). One of these proteins, the product of locus slrOl 1 6 (/. e. , pcyA) in the 
genome of the cyanobacterium Synechocystis sp PCC6S03, appears to be part of an operon 
with a putative response regulator located 62 bp upstream (Ashby and Mullineaux (1999) 
FEMS Microbiol Lett 181: 253-260). Interestingly, this response regulator belongs to the 
OmpR subfamily for which a mutation (yc£27) was shown to cause a reduced energy 
transfer from the phycobilisomes to photosystem I (Ashby and Mullineaux (1999) FEMS 
Microbiol Lett 181: 253-260). pcyA-related open reading frames (orfs) also were found in 
the oxyphotobacterium Prochlorococcus sp. MED4 (CCMP1378), which is also known as 
Prochlorococcus marinus MED4, in the marine cyanobacterium Synechococcus sp 
WH8102, and in the nitrogen-fixing, heterocyst-forming filamentous cyanobacteria 
Anabaena sp PCC7120 and Nostoc punctiforme. Among the other identified HY2-related 
genes are two orfs, orf236 and orf257, from the marine cyanobacterium Synechococcus sp 
WH8020 that lie adjacent to each other within the major phycobiliprotein gene cluster 
(Wilbanks and Glazer (1993) J. Biol Chem. 268: 1226-1235; Wilbanks and Glazer (1993b) 
J. Biol Chem. 268: 1236-1241). These orfs, which encode the proteins Ycp2_SYNPY and 
Ycp3_SYNPY, appear to be part of a three-gene operon containing an upstream orf of 
unknown function, orfZOO. A similar operon was identified in Synechococcus sp WH8 1 02. 
The genomes of N. punctiforme and Prochlorococcus, both the MED4 and S3 120 
(CCMP1375) subspecies, also contain similar operons. in contrast to the N. punctiforme 
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5 and Anabaena operons, an upstream orf in the Prochlorococcus operons exhibits a striking 
similarity to the ferredoxin-dependent heme oxygenase gene HY1 (Davis et al (1999) Proc. 
Natl Acad, Set, USA, 96: 6541-6546; Muramoto et al (1999) Plant Cell 11: 335-347) and 
its homologs in the cyanobacterium Synechocystis sp PCC6803 (Cornejo et al (1998) Plant 
J. 15: 99-107). On the basis of their roles in PEB biosynthesis shown in this study, we 

1 0 name these OKPs pebA and pebB, 

PSI-BLAST iterations also identified a weak relatedness of HY2 to the red 
chlorophyll catabolite reductase (RCCR) from barley and Arabidopsis. RCCR is involved 
in chlorophyll catabolism and catalyzes the ferredoxin-dependent reduction of the linear 
tetrapyrrole, red chlorophyll catabolite (RCC), to yield the primary fluorescent chlorophyll 

15 catabolite (Wiithrich et al (2000) Plant J. 21: 189-198). These investigators showed that 
RCCR was incapable of reducing BV to either bilirubin or PB (Wiithrich et al (2000) Plant 
J, 21 : 189-198). Interestingly, the sequence similarity between RCCR and the other HY2- 
related proteins is so weak that TBLASTN searches using the two RCCR sequences failed 
to identify HY2 or other HY2-related proteins present in the publicly available databases 

20 (Wiithrich et al (2000) Plant 1 21:1 89-198). This divergence undoubtedly reflects the 
unusual substrate specificity of the RCCR for bilins derived from chlorophyll catabolism. 

Phylo genetic analysis of the HY2-related family of proteins was performed 
using a heuristic parsimony search with a modified PAM250 weighting matrix and the 
program PAUP* version 4.0 (see Methods). A single tree obtained with this analysis (Fig 

25 1 1) revealed four clades of HY2-related proteins with strong bootstrap support: PcyA, 
PebA, PebB, and RCCR. We noted that HY2 lies within in the PebB clade. 

Recombinant HY2-Related Proteins Are Bilin Reductases 

The HY2-related cyanobacterial orfs were amplified by polymerase chain 
reaction and cloned into the Escherichia coli expression vector pGEX-6P-l, which is 
30 similar to the vector described for mHY2 (Example 1). With this vector, the proteins were 
expressed as glutathione S-transferase (GST) fusions, which enabled their purification by 
affinity chromatography. The GST tag was removed via site-specific protease digestion, 
which resulted in an additional five to eight N-terminal amino acids due to the cloning 
strategy. Figure 12 shows SDS-PAGE results of purified recombinant protein 

■ 

35 representatives of the PcyA, PebA, PebB, and HY2 subfamilies. One liter of bacterial 
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culture gelded between 1 and 10 mg of soluble recombinant protein depending on which " 

■ 

protein was expressed. The deduced molecular masses of the recombinant processed 
proteins, confirmed by SDS-PAGE (Fig 12), are as follows: Anabaena PcyA, 28.7 kD; 
Synecliocystis PcyA, 28.9 kD; Synechococcus sp WHS020 PebA, 28 kD; Synechococcus sp 
WH8020 PebB, 303 kD; and Arabidopsis mHY2, 33.4 kD. 

To deteraiine whether the recombinant HY2-r elated proteins possess bilin 
reductase activity, we used a coupled holophytochrome assembly assay to analyze crude 
protein extracts from E. coli expressing these proteins for their ability to convert BV to 
phytobilins under standard PB synthase assay conditions. Figure 13 A shows that crude 
bacterial lysates containing GST fusions of mHY2 and PcyA_SYNY3 (PcyA_ANASP; not 
shown) all exhibited BV reductase activities, yielding phytobilin products that could 
combine with the cyanobacterial phytochrome Cphl apoprotein (apoCphl) to yield 
phytochrome difference spectra. The bilin metabolites incubated with apoCphl resulted in 
different maxima and minima, suggesting that the various proteins reduced BV to distinct 
products. Both PcyA-containing extracts produced a BV metabolite(s) that gave spectra 
identical to those of the PCB adduct of apoCphl, with difference maxima at 655 nm and 
minima at 705 nm (Yeh et al (1997) Science 277: 1505-1508). Figure 13A shows that both 
difference peaks of the mHY2 metabolites were markedly red shifted, with maxima at 670 
and 730 nm, which is characteristic of the POB adduct of apoCphl (Yeh et al (1997) 
Science 211: 1505-1508; Example 1). Identical results were obtained using the purified 
recombinant HY2 and PcyA proteins (data not shown). Similarly, E. coli extracts lacking 
HY2 or PcyA proteins failed to metabolize BV to bilin products that could functionally 
assemble with apoCphl (data not shown). 

In contrast to the results for PcyA and HY2, no phytochrome difference 
spectrum was observed when the BV metabolites from reactions containing PebA_SYNPY, 
PebB_SYNPY, or a 1:1 mixture of the two Synechococcus-dcrived proteins were incubated 
with apoCphl , To determine whether fusion to GST is responsible for inhibiting the 
enzyme activity of these proteins, GST was removed by protease digestion and the full- 
length proteins were purified (Figure 12). Neither the purified proteins nor the 1 : 1 mixture 
of PebA and PebB were able to convert BV to a bilin product(s) that yielded a photoactive 
adduct with apoCphl (Fig 13 A). The observation that coincubation of a 1 : 1 ratio of PebA 
and PebB with BV elicited a color change of the assay mixture from bluish-green to pink 
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5 suggested that these proteins converted B V to bilins unable to form a photoconvertible 
holophytochrome. It is noteworthy that this pronounced color change was not observed 
when either PebA or PebB was assayed separately. This strongly implied that the 
PebA/PebB mixture could convert BV to PEB, the precursor of the phycobiliprotein C- 
phycoerythrin, 

10 To test this hypothesis, BV-derived bilin metabolites from PebA, PebB, and 

PebA/PebB were incubated with apoCphl, and the mixtures were analyzed 
spectrofluorometrically for the production of the fluorescent PEB-apoCphl "phytofluor" 
adducts (Murphy and Lagarias (1997) Curr. Biol 7: 870-876). Only the PebA/PebB 
product mixture yielded a highly fluorescent compound, whose excitation and emission 

1 5 spectra were consistent with the formation of a phytofluor (Figure 1 3B), This result 
suggested that PebA and PebB were both required for the conversion of BV to PEB. 

HPLC Reveals Distinct Substrate/Product Specificity for Each Member of the 
HY2 Family 

HPLC analysis was performed to identify the bilin metabolites of the HY2 
20 family members using a chromatographic system that is able to separate 3E and 3Z isomers 
■ of PB, PCB, and PEB, As shown in Example 1, recombinant mHY2 efficiently reduced BV 
by two electi ons to yield a mixture of both isomers of PB. In comparison, both PcyA 
proteins converted BV to a mixture of the 3E and 3Z isomers of PCB, a four-electron 
reduction (Fig 14). A time-course experiment was performed and revealed no evidence for 
25 other colored bilin intermediates (data not shown). Incubation of PebA_SYNPY with BV 
resulted in the formation of an early eluting product that was detectable only at 560 nm and 
not at 380 nm (Fig 14). Optical spectroscopy revealed that this product had an absorption 
maximum at 575 nm in acetone:20 mM formic acid (50:50, v/v) (data not shown). Based 
on its absorption spectrum (data not shown), early retention time, and results shown below, 
30 this product was determined to be 1 5 , 16-DHBV. A similar absorption spectrum for DHBV 
has previously been published (Beale and Cornejo (1991b) J. Bioh Chem. 266: 22333- 
22340). In contrast to PebA_SYNPY, PebB_SYNPY was unable to metabolize BV (Figure 
14). Identical results were observed with the N. punctiforme PebA and PebB homologs 
(data not shown). 
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A mixture of PebA and PebB effectively converted all of the BV to two 
colored pigments, one purple (retention time 9 min) and the other pink (retention time 10.5 
min), whose retention times differed from that of 15,16-DHBV (Figure 14). Both bilin 

k 

metabolites have absorption maxima in acetone:20 mM formic acid (50:50, v/v) near 580 
nm (data not shown). Because PebB could not metabolize BV, these results suggest either 
that the 15,16-DHBV product of PebA was metabolized by PebB to the purple and pink 
bilins or that PebB forms a complex with PebA to alter its product profile. That 15,16- 
DHBV was a substrate for PebB was demonstrated by incubation of PebB with HPLC- 
purified 15,16-DHBV. In this case, the same two bilin products were observed (data not 
shown). HPLC coelution experiments showed the purple and pink pigments to be the 3 E 
and 3Z isomers of PEB, respectively (data not shown). Both pigments are chemically stable 
in the HPLC mobile phase, eluting as single peaks after purification and reinjection. 
Moreover, both HPLC-purified pigments form phytofluors upon incubation with apoCphl, 
indicating that these are configurational isomers of PEB. HPLC-purified 15,16-DHBV from 
the PebA-mediated reduction of BV ? however, was unable to form a fluorescent adduct with 
apoCphl (data not shown), 

Biochemical studies of ferredoxin-dependent bilin reductases from algae and 
plants indicated that the 3Z isomers of PEB, PCB, and PB were the primary metabolites of 
these enzymes, with the formation of the 3E isomer requiring distinct bilin isomerase(s) 
(Beale andCornejo (1991a) J. Biol Chem. 266: 22328-22332; Beale and Comejo (1991b) 
J. Biol. Chem, 266: 22333-22340; Beale and Comejo (1991c) J. Biol Chem. 266: 22341- 
22345; Cornejo and Beale (1997) Photosyntk Res. 51: 223-230). Our results show that 
both bilin isomers are produced with recombinant HY2, PcyA, and PebA/PebB proteins. 
We believe that the production of the 3E isomers occurred because of the presence of 
glutathione in the assay mixture and because of heating in the Speed-Vac concentrator. In 
this regard, glutathione-mediated 3Z to 3E isomerization of phycobilins has been reported 
for bilin reductases from C. caldarium (Beale and Cornejo (1991) X Biol Chem. 266: 
22341-22345), Preliminary experiments performed with GST fusion proteins that did not 
come in contact with reduced glutathione or with proteins that were elute'd from the affinity 
column by protease digestion greatly increased the relative amount of 3Z isomers produced 
(data not shown). Heating that occurred during concentration also contributed to the 
formation of the 3E isomers. If the drying time was reduced, only 3Z isomers were detected 
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5 (data not shown). Therefore, we conclude that in all cases the 3Z isomers are the primary 
reaction product of these reductases and that the production of the 3E isomers occurs by 

m 

non-enzyme-mediated side reactions caused by heat and reduced glutathione. 

The HY2 Family of Bilin Reductases Are Ferredoxin Dependent 

All of the reductive interconversions of BV and 15,16-DHBV presented here 
10 were dependent on reduced ferredoxin, which necessitated the inclusion of 

ferredoxin :NADP+ oxidoreductase and an NADPH-regenerating system in the assay 
mixture. Indeed, none of the reduced bilin metabolites were detectable via HPLC when 
either ferredoxin or the NADPH-regenerating system was omitted from the assay mixture 
(data not shown). These results are in agreement with the ferredoxin dependence of the 
15 bilin reductases from plants and algae (Beale (1993) Chem. Rev. 93: 785-802). Thus, this 
family of proteins constitutes a new class of bilin: ferredoxin oxidoreductases (EC l,3.7.n). 

Discussion, 

Using a combination of protein-based pattern searches of genomic databases, 
phylogenetic analysis, and biochemical characterization, these investigations establish that 

20 the HY2 family of ferredoxin-dependent bilin reductases can be subdivided into five 
classes: PcyA, PebA, PebB, HY2 ? and RCCR families (Figure 11). This classification 
system is supported by the distinct substrate preference and double-bond regiospecificity of 
representative members of each bilin reductase subfamily. PcyA, PebA, and HY2 all 
recognize BV as a substrate, yet each yields different bilin products. PebB and RCCR 

25 possess unique bilin substrates (i.e., 15,16-DHBV and RCC, respectively), and neither 
metabolizes BV (Wuthrich et al (2000) Plant J. 21: 189-198; this study). Biochemical 
analyses of representatives of the three new classes of bilin reductases identified here, 
PcyA, PebA, and PebB, document their involvement in the biosynthesis of the 
phycobiliprotein chromophore precursors PCB and PEB. 

30 The PcyA Family of Ferredoxin-Dependent BV Reductases Plays a Key Role in 

PCB Biosynthesis 

In this investigation, we have documented that the pcyA genes of the 
cyanobacteria Synechocystis sp PCC6803, Anabaena sp PCC7120, and N. punctiforme (data 
not shown) encode bilin reductases that catalyze the four-electron reduction of BV to 3Z- 
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5 PCB. PCB is the precursor of the chromophores of the phycobiliproteins phycocyanin and % 
allophycocyanin, which are abundant in all three cyanobacteria. PcyA enzymes are atypical 
bilin reductases because all others catalyze two-electron reductions. Formally, these 
enzymes catalyze two -electron reductions of both the A and D rings of BV; however, we 
have not detected the production of semi-reduced intermediates such as PB and 1S 1 5 1S 2 - 
1 0 DHBV. Thus, it appears that the partially reduced intermediates are tightly bound to the 

j- 

enzyme, The direct conversion of BV to PCB in these cyanobacteria is in contrast to the 
proposed pathways of PCB biosynthesis in the red alga C. caldarium, which involves the 
intermediacy of PEB (Beale (1993) Chem. Rev. 93: 785-802), and in the green algaM. 
caldariorum, in which 3Z-P<0B is an isolable intermediate (Wu et al (1997) J, Biol Chem. 

15 272: 25700-25705). pcyA-related genes also are present in the oxyphotobacterium 
Prochlorococcus sp MED4, an unanticipated observation in view of the lack of 
phycobiliproteins in this organism. Phylogenetic analyses place tins oxyphotobacterial 
protein in the PcyA clade of PCB:ferredoxin oxidoreductases. We were also able to clone 
the Prochlorococcus sp MED4 pcyA gene and express it as an N-terminal GST fusion. We 

20 determined that recombinant PcyA_PROME was able to reduce BV to PCB in our standard 
phytochrome-based assay (data not shown). It therefore possesses the same enzymatic 
activity as all other studied PcyA enzymes. 

peb Qperons Encode Bilin Reductases Involved in PEB Biosynthesis 

We have observed that the peb A and pebB genes of the cyanobacteria 
25 Synechococcus sp WH8020 and K punctiforme encode bilin reductases that catalyze the 
conversions of BV to 15,16-DHBV and 15,16-DHBV to 3Z-PEB, respectively (Fig 1). 
PebA therefore is a 15,16-DHBV:ferredoxin oxidoreductase, whereas PebB is a 3Z- 
PEB:ferredoxin oxidoreductase. Both activities are consistent with the pathway of PEB 
biosynthesis in the red alga C. caldarium (Beale (1993) Chem. Rev. 93: 785-802). The two 
30 peb genes also are found in the same operon in both phycoerythrin-producing 

cyanobacteria, and their close association with the major phycobiliprotein gene clusters 
supports their role in phycobilin biosynthesis (Wilbanks and Glazer (1993) X Biol Chem. 
268: 1236-1241). We hypothesize that PebA and PebB function as a dual enzyme complex, 
in view of the synergistic metabolism of BV observed when the two enzymes are 
35 coincubated. A peb operon is not present in the genome of the cyanobacteriurn 
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5 Synechocystis sp PCC6803, an organism that lacks phycoeiythrin. This strongly suggests 
that PCB is synthesized in this cyanobacterium via the PcyA-dependent pathway, as 
opposed to the PEB pathway found in C. caldarium (Beale (1993) Chem. Rev, 93: 785- 
802). In this regard, biochemical analyses of crude extracts from Synechocystis sp 
PCC6803 provide no evidence for the production of PEB (Comejo and Beale (1997) 

10 Photosynth. Res. 51: 223-230). 

The MED4 and SS120 subspecies of the oxyphotobacteria Prochlorococcus 
also possess peb operons very similar to those of Synechococcus sp WH8020 and WH8102, 
except that the former possess upstream genes related to heme oxygenase. This strongly 
suggests that both oxyphotobacterial subspecies can synthesize PEB, In this regard, genes 

1 5 encoding the a and B subunits of a novel phycoerythrin have been identified in the SS 120 
subspecies of Prochlorococcus (Hess et al. 1996 R29R29, Hess et al. 1999 R30R30). It 
also has been shown that this unusual phycoerythrin plays a role in light harvesting in this 
ecotype (Lokstein et al (1999) Biochim. Biophys. Acta 1410, 97-98), which is adapted for 
photo autotrophic growth at great ocean depths where light is limited. This observation is 

20 consistent with the lack of phycoerythrin genes in the high light-adapted MED4 ecotype. 
Although the enzymatic activities of Prochlorococcus PebA and PebB have not been 
determined experimentally, our phylo genetic reconstructions suggest that these proteins 
may be functional orthologs of the Synechococcus and Nostoc enzymes. Further analysis of 
the bilin bio synthetic pathways in Prochlorococcus and marine cyanobacteria such as 

25 Synechococcus sp WHS020 will be interesting, because the shorter wavelength-absorbing 
phycourobilin chromophores are major constituents of their phycoerythrins (Ong and Glazer 
(1991) J, Biol Chem. 266: 9515-9527; Hess etal (\996)Proc. Natl Acad. Scl t USA, 93: 
1 1 126-1 1130). Although we have identified PCB and PEB biosynthetic enzymes in both 
organisms, it remains to be determined whether either these or other enzymes play a role in 

30 phycourobilin biosynthesis, 

■ 

Phvcobilin Isomerases: Are They Necessary? 

PcyA, HY2, and PebB mediate bilin reductions that yield the 3Z isomer of 
their respective products. Because numerous studies have established that the more 
tliermodynamically stable 3E isomers are substrates for assembly of the phycobiliprotein 
35 and phytochrome holoproteins, it has been proposed that there are unique 3Z/3E isomerases 
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that mediate this interconversion (Beale (1993) Chem. Rev. 93: 785-802; Terry et al (1993) 
Arch. Biochem. Biophys. 306: 1-15). It should be noted that the 3Z isomer of PB has been 
shown to be a substrate for apophytochronie (Terry et al (1995) J. Biol Chem. 270: 1111 1— 
11118); however, these investigators suggested that isomerization to the 3E isomer is 
necessary to yield the correct stereochemistry of the holophyto chrome chromophore. Such 
an isomerase activity has been identified in extracts of the red alga C. caldarium; however, 
this reaction also can be mediated by reduced glutathione (Beale and Comejo (1991) J. Biol 
Chem. 266: 22333-22340). For this reason, the need for a 3Z/3E isomerase has been 
questioned. All of the hy mutant loci have now been cloned from Arabidopsis, and none of 
these genes appear to encode a bilin isomerase. Thus, the isomerization of 3Z-PC&B may 
occur chemically or may be catalyzed by a genetically redundant family of bilin isomerases. 

X-ray crystallographic analyses of phycobiliprotems have revealed that the 
stereochemistries of the thioether linkages to distinct cysteine residues are not all the same 
(Schirmer et al (19S7) J. Mol Biol 196: 677-695; Schmidt et al (1987) Z Naturforsch. 
42C, 845-848). Therefore, we propose that the different stereochemistries arise from the 
use of the 3Z and 3E isomers of the phycobilin precursor as substrates for assembly to 
distinct cysteinyl moieties, Beale and Comejo (1991) J, Biol Chem. 266: 22333-22340, 
have identified a bilin isomerase that catalyzes the conversion of 3Z-PEB to 3Z-PCB in C. 
caldarium, which appears to be the sole pathway for PCB biosynthesis in this organism. 
More recently, a lyase/isomerase from the cyanobacteiium Mastigocladus laminosus was 
described that is involved in both the isomerization of PCB to phycoviolobilin and its 
covalent attachment to apophycoerythrocyanin (Zhao et al (2000) FEES Lett, 469: 9-13). 
On the basis of these results and the diversity of bilin isomers found in phycobiliproteins 
from marine cyanobacteria, cryptomonads, and oxyphotobacteria (Ong and Glazer (1991) 
Biol Chem. 266: 9515-9527; Hess et al (1996)Proc. Natl Acad. Set, USA, 93: 11126- 
11130; Wedemayer et al (1996) Photqsynth. Res. 48: 163-170), it is likely that numerous 
bilin isomerases are present in these oxygen-evolving photosynthetic organisms. 

Molecular Evolution of the HY2 Family of Bilin Reductases 

A single phylogenetic tree that is well supported with bootstrap replicates 
was obtained for the HY2 family (Fig 1 1). This tree delineates four clades of bilin 
reductases, which is in good agreement with the enzymes' double-bond specificity for 
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5 reduction. HY2 appears most closely related to the PebB clade of emymes that catalyze a 
reduction of 1 5, 1 6-DHBV to PEB . We predict that phytochromobilin synthases, because of 
their exquisite B V substrate specificity, will fomi a distinct clade when HY2 orthologs from 
other plant species are identified. The relatedness of HY2 and PebA enzymes is reasonable 
because both families mediate reduction of the vinyl pyrrole A ring to form the ethylidene 

10 moiety. We speculate that these two classes arose from a common ancestor that used BV as 
a substrate. This notion is based on the observation that the PebA family of bilin reductases 
also recognizes BV as a substrate. 

Unlike HY2 and PebB, members of the PebA family target the 15,16 double 
bond of BV for reduction. To evolve the PebA and PebB/HY2 subfamilies, we envisage a 

1 5 duplication of a ferredoxin-dependent BV reductase gene and subsequent divergence in a 
marine cyanobacterium growing in a light-limited environment. Such an environment 
would provide the selection pressure favoring evolution of the biosynthetic pathway for 
PEB, whose incorporation into phycoerythrin extends the light-harvesting wavelength range 
of their phycobilisomes. Depending on the rooting of the HY2 family tree, the comparative 

20 branch lengths of the PebA and PebB/HY2 families on the phylogenetic tree suggest that 
the A ring reductases are more ancient, with the 15,16 reductases evolving more recently. 
On the basis of these inferences, we speculate that a cyanobacterial progenitor of plant 
chloroplasts possessed a bilin reductase with an A ring reductase regiospecificity. The 
progenitor of present day cyanobacteria likely would have possessed the ability to 

25 synthesize PCB, an essential component of their allophycocyanin-containing phycobilisome 

i 

core. Thus, the common pebA/pebB ancestor might have resembled present-day pcyA 
genes, which encode atypical BV reductases that catalyze the four-electron reduction of BV 
to PCB. To date, pcyA genes appear to be present in all cyanobacteria, whereas a peb 
operon is lacking in the phycoerythrin-deficient cyanobacterium Synechocystis sp 
30 PCC6803, 

The role of the pebA, pebB, and pcyA genes in Prochlorococcus sp MED4 
remains a mystery. Members of this genus are distinguished by the presence of integral 
membrane antennae complexes that contain divinyl chlorophyll a2 and \>2 and by the lack of 
phycobilisomes (Partensky et al (1999) Microbiol Mol Biol Rev. 63: 106-127), 
35 Functional phycoerythrins have been detected only for the SS120 subspecies. As such, 
these organisms have been thought by some to be descendants of the class of prokaryotic 
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5 photo synthetic organisms whose endosymbiosis led to higher plant chloroplasts. 

Phylogenetic analyses using 16S rRNA indicate that this probably is not the case, because ^ 
Prochlorococcus species appeal" more similar to marine Synechococcus species than to 
chloroplasts (Urbach et al (1998) X Mol Evol 46: 188-201). These analyses also suggest 
that Prochlorococcus evolved more recently from a phycobihsome-containing ancestor that 

10 resembled a marine Synechococcus species. The need for-pebA, pebB, ondpcyA genes for 
phycobilin biosynthesis in this ancestor is self-evident, and such genes may not yet have 
been lost from Prochlorococcus species. It is conceivable that these BV reductases are 
required to make bilin chromophore precursors of light receptors, such as the phytochromes 
(Hughes and Lamparter (1999) Plant Physiol 121: 1059-1068). Although phytochrome- 
like genes are abundant in some cyanobacterial genomes, none are present in the genome of 
Prochlorococcus sp MED4 (data not shown). Alternatively, BV reductases may be needed 
to drive heme oxygenase, whose role in iron metabolism is well documented (Poss and 
Tonegawa (1997a) Proa Natl Acad. Set, USA, 94: 10919-10924; Poss and Tonegawa 
(1997b) Proa Natl. Acad. ScL. USA, 94: 10925-10930; Richaud and Zabulon (1997) Proa 

20 Natl Acad Sci. f USA f 94: 11736-11741; Schmitt (1997) J. Bacterial 179: 838-845). 

In addition to the bilin reductases involved in phytobilin biosynthesis, a 
separate class exists of bilin reductases that are involved in chlorophyll degradation 
(Hortensteiner et al (1998) J. Biol Chem. 273, 15335-15339; Wuthrich et al (2000) Plant 
J. 21: 189-198). The pathway of chlorophyll degradation that occurs during plant 

25 senescence is similar to the heme degradation pathway (Matile and Hortensteiner (1999) 

Annu. Rev. Plant Physiol Plant MoL Biol 50: 67-95). After dephytylation and magnesium 
removal, the chlorophyll macrocycle ring is opened by a monooxygenase that has yet to be 
cloned (Hortensteiner et al (1998) J. Biol Chem. 273, 15335-15339). This is followed by 
a ferredoxin-dependent reduction of the bilin product catalyzed by the RCCR (Hortensteiner 

30 et al (2000) Plant Biol. 2: 63-67; Wuthrich et al (2000) Plant J. 21: 189-198). RCCRs are 
the most diverged members of the ferredoxin-dependent bilin reductase family. Indeed, 
these enzymes have markedly different substrate specificities. It is notable that RCCRs 
catalyze a reduction very similar to that mediated by the PebA family (i.e., a 15,16 double- 
bond reduction). The structural deteiminants that are responsible for RCCRs unique 

35 substrate specificity and double-bond regiospecificity will be interesting to discover. 
Presumably, chlorophyll catabolism would be important for chlorophyll-containing 

+ 
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5 prokaryotes; however, to date, RCCR genes are not readily identifiable in the genomes of 
any photosynthetic prokaryotes. It is possible that RCCR genes were lost, or alternately, 
that they evolved more recently from an HY2-like gene in the chloroplast endosymbiont 
progenitor, because they are found in cryptogams and plants (Hortensteiner et al. (2000) 
Plant Biol. 2: 63-67). 

10 Mechanistic Implications 

Ferredoxin-dependent bilin reductase catalyzes two- and four- electron 
reductions of linear tetrapyrroles. Because ferredoxin is a one-electron carrier, these 
enzymes are mechanistically quite different from the NAD(P)H-dependent BVR/BvrD 
family of BV reductases. Preliminary analyses to date have failed to identify a metal or 

15 flavin cofactor in any of the recombinant enzymes reported here, suggesting that electrons 
are transferred directly to the bilin moiety, possibly via reduction of an amino acid residue 
within the enzyme. Although this finding suggests the presence of bilin radical 
intermediates, additional experiments are needed to assess this hypothesis- The oxygen 
sensitivity of RCCR supports the hypothesis that bilin radicals, which react with molecular 

20 oxygen, are produced during RCC catalysis (Wuthrich et al (2000) Plant 1 21: 189-198). 

Examination of highly conserved residues in the entire HY2 family and those 
within each of the five classes of bilin reductases provides information regarding residues 
important to the protein structure, ferredoxin interaction site, and substrate/product 
specificity. In this regard, only a small number of residues are conserved in the entire HY2 

25 family of enzymes. These include hydrophobic residues at positions 137, 157, 158, 256, 
and314,Pro-151,Phe-221, Ser-222, andAsp-171 (Figure 10). The notable lack of 
conserved basic residues suggests that the propionyl moieties of the bilin substrates do not 
form salt linkages with the enzymes. The conserved hydrophobic residues proline and 
phenylalanine are likely to be involved in overall protein structure (i.e., folding). 

30 Alternately, they may form hydrophobic interactions with conserved regions of the various 
bilin substrates. The loss-of-function hy2-l and hy2-104 alleles of phytochromobilin 
synthase from Ardbi dopsis support the critical role of Pro-151 in HY2's structure. The 
conserved serine and aspartate residues likely play catalytic roles, such as hydrogen bonding 

4 

with the substrate and/or substrate protonation to make the bound bilin a better electron 
35 acceptor. 
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Despite the wide divergence of the HY2 family, we believe that these 
conserved residues indicate that the active sites of all members of this class are similar. We 
speculate that the distinct double-bond reduction specificities of the B V reductases (/. e. , 
PcyA, PebA, HY2), the 15,16-DHBV reductases PebB), and the RCCR families reflect 
the positioning of the respective substrates within the catalytic pocket. Because the A/B and 
C/D rings of BV are very similar but not identical, it is conceivable that the substrate 
binding sites of the PebA and HY2 enzymes are tailored to position BV in opposite - 

orientations, favoring electron transfer to the bilin C/D ring or A ring, respectively..... If this is 

• l >• 

true, then the PebB class might tether its 15,16-DHBV substrate in an orientation similar to 

. ■■' 

that of the HY2 family, whereas RCC might be bound to RCCR in a manner similar to that 
in which BV is bound to PebA. Future studies will address the unique substrate/product 
specificity using domain swapping, site-directed mutagenesis, synthetic biliverdin * • 

, V- ' 

substrates, and x-ray crystallography. ' ; }: 



Biotechnological Implications : V : 5; 

. 

■'•"y.'i* 

The availability of genes for bilin reductases that mediate the biosynth^is of 
POB, PCB, and PEB provides us with useful tools for numerous biotechnological vjM 
applications. The ability to engineer the biosynthesis of PEB in any BV-producing f 

organism is now feasible via the introduction of one or two genes, hi this way, phytofiWs 

-. 'Mi 

potentially can be produced in any ferredoxin-containing organism, Coexpressiori of bilin 

■ y : i 

reductase genes with apophytochromes should enable us to produce holophytochroirieg in 

*. 

f 

bacteria and yeast. This will facilitate not only three-dimensional structural analysis djf 

, v.' 

phytochrome but also the reconstruction of phytochrome signaling in a nonplant systefn in 
which we can exploit the power of molecular genetic analyses. This approach has pf#^en 
invaluable for the structure^fimction analysis of the steroid hormone receptor family;' '-By 
introducing the pcyA gene into wild-type and chromophore-deficient mutant plants, w6 also 

> i 

. • { 

should be able to change the wavelength specificity of phytochrome, which may favorably 
alter plant growth and development in the field environment. Introduction of the pebAmd 
pebB genes into plants potentially will shunt the conversion of BV to PEB, yielding 
photomorphogenetically challenged plants with fluorescent phytochromes. This would be 
especially useful for the analysis of the temporal and spatial patterns of phytochrome i 
expression in plants . 
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5 Methods. 

Reagents 

All chemicals, including glutathione agarose, were purchased from Sigma 
(St. Louis, MO) and were American Chemical Society grade or better. Restriction enzymes 
and Taq polymerase were from Gibco BRL (Cleveland, OH). HPLC-grade acetone and 80% 
10 formic acid were purchased from Fisher Scientific (Pittsburgh, PA). The expression vector 
pGEX-6P-l and PreScission protease were obtained from Amersham Pharmacia Biotech 
(Piscataway, NJ). Centricon- 1 0 concentrator devices were purchased from Amicon 
(Beverly, MA). 

Bioinformatics 

15 Protein and nucleic acid database searches were performed using programs at 

publicly available World Wide Web sites. Preliminary sequence data were obtained from 
the Department of Energy Joint Genome Institute (http://spider jgi- 
psf.org/JGI_microbiaI/Iitml/). Multiple sequence alignments were performed using the 
programs CLUSTAL W (Higgins et al. 1996 R21R21), GENEDOC 

20 (http://www.psc.edu/biomed/genedoc), and MEME (Bailey and Elkan (1995) pp. 21-29 In 
Proceedings of the Third Inter-national Conference on Intelligent Systems for Molecular 
Biology, Menlo Park, CA: American Association of Artificial Intelligence Press) to guide 
hand alignments. Phylo genetic analysis of the HY2-related family of proteins based on the 
alignment shown in Fig 10 was conducted using a heuristic parsimony search with a 

25 modified PAM250 weighting matrix (Dayhoff et al (1978) Pp 345-352 In: Atlas of Protein 
Sequences and Structure, M.O, Dayhoff, ed, Washington, DC: National Biomedical 
Research Foundation) using the program PAUP* version 4.0 (Swofford (1993) J, Gen. 
Physiol 102: 9 A), 

Because there are negative values in the PAM250 matrix, the most negative 
30 penalty was set equal to zero, and all other values were increased correspondingly. Scores 
for transitions to and from gaps were not defined in the original matrix; they were set equal 
to the most costly transition (25) defined in the matrix. Characters 1 to 65 and 323 to 368 in 
the alignment were excluded from our analysis because they correspond to N- and C- 
terminal extensions not common to all members of the HY2 family (i.e., plastid transit 
35 peptide found on HY2 and red chlorophyll catabolite reductase (RCCR), C-terminal 
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5 extension found only on HY2). VorHordewn vulgare RCCR, missing characters 65 to 116 
were replaced with question marks, which were weighted as zero. A rescaled consistency ^ 

s 

index was used for character weighting. 

Construction of Expression Vectors 

HY2 -related genes from Synechbcystis sp PCC6803, Synechococcus sp 
1 0 WH8020, and Anabaena sp PCC7120 were amplified from chromosomal DNA via 

polymerase chain reaction using the following primers, which contained the indicated and 
underlined restriction sites: Synechocystis pcyA, BamHIfwd: 5'-AAG GAT CCA TGG 
CCG TCA CTG ATT TAA G-3' (SEQ ID NO: 19), Sallrev: 5*-ACG CGT CGA CTA TTA 
TTG GAT AAC ATC AAA TAA GAC-3' (SEQ ID NO:20); Synechococcus pebA 
1 5 EcoRIfwd: 5'-GGA ATT CAT CTT TGA TTC ATT TCT CAA TG-3' (SEQ ID NO:21), 

Notlrev: 5'-ATA GTT AGC GGC CGC TCA TTT GTG AGA GGA GGA GGC-3' (SEQ ID 
NO:22); Synechococcus pebB, EcoRIfwd: 5'-GGA ATT CAT CAC AAA TCA AAG ATT 
CAA AAG C-3' (SEQ ID NO:23), Notlrev: 5'-ATA GTT AGC GGC CGC TTA TAG ATC 
AAA AAG CAC AGT GTG G-3' (SEQ ID NO:24); and Anabaena pcyA, EcoRIfwd: 5'- 
20 GGA ATT CAT CTC ACT TAC TTC CAT TCC CTC-3* (SEQ ID NO:25), Notlrev: 5'- 

ATA GTT AGC GGC CGC TTA TTC TGG -GA GAT CAA ATA AC-3' (SEQ ID NO:26). 
The polymerase chain reaction products were then cut with the indicated enzymes and 
inserted into similarly restricted pGEX-6P-l. The integrity of the plasmid constructs was 

4 

verified by complete DNA sequence determination of the insert (Davis Sequencing, Davis, 
25 CA). All of the constructs place the HY2-related gene downstream of and in frame wj|h the 
glutathione S-transferase (GST) gene of Schistosoma japonicum under the control of Jptac 
promoter, A recognition sequence for PreScission protease is located upstream of the 
cloned gene. Proteolytic cleavage yields the native protein with a small N~ terminal 
extension* In all cases, the original initiation methionine was changed to an isoleucine. 

30 Expression and Piirification 

Expression and purification were performed according to instructions 
supplied by the manufacturer (Amersham Pharmacia Biotech) and as described in Example 

M 

1 . Between 1 and 1 0 mg of purified protein was obtained per liter of bacterial culture. 
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5 Protein Determination 

Protein concentration was determined by the Bradford method with BSA as a 
standard (Bradford 1976 R48R4S) or by measuring the absorbance at 280 run and using the 
'calculated 280 nm for each individual protein (Gill and von Hippel (1989) Anal, Biochem. 
182: 319-326). 

10 Standard Bilin Reductase Activity Assay 

Assays for bilin reductase activity were performed as described for P<M3 
synthase (see Example 1), 

Direct HPLC Analysis 

Bilin reductase assay mixtures were loaded onto a Waters (Milford, MA) 

15 C 1 8 Sep-Pak Light preconditioned as follows: 3-mL wash with acetonitrile to wet the Sep- 
Pak, 3-mL wash with MilliQ water, and 3-mL wash with 50 mM 4- 
methylmorpholine/glacial acetic acid, pH 7,7. After the sample was loaded onto the Sep- 
Pak, it was washed with 3 mL of 4-methylmorpholine/glacial acetic acid, pH 7. 7, followed 
by 3 mL of 0. 1% (v/v) trifluoro acetic acid. The bilin metabolites were then eluted from the 

20 Sep-Palc with 2 mL of 100% acetonitrile. The eluate was dried using a Speed-Vac 
lyophilizer (Savant Instruments Inc., Farmingdale, NY), and the dried samples were 
analyzed by HPLC. Samples were first dissolved in 10 |jlL of DMSO and then diluted with 
200 juL of the HPLC mobile phase (50:50 v/v acetone:20 mM formic acid). After the 
samples were dissolved, they were centrifuged briefly to collect the sample, passed through 

25 a 0. 45-jim polytetrafluoroethylene syringe filter, and chromatographed using a Varian 

. (Palo Alto, CA) 5000 liquid chromatography The HPLC column used for all of the analyses 
was aPhenomenex (Torrance, CA) Ultracarb 5-pim ODS20 4. 6 x 250-mm analytical 
column with a 4. 6 x 30- mm guard column of the same material The mobile phase used 
with this column was acetone:20 mM formic acid (50:50, v/v). The flow rate was 0. 8 

30 mL/min. The eluate was monitored at 560 nm for the first 11.5 min and at 380 nm for the 
remaining time using a Varian UV100 flow-through absorbance detector. Peak areas were 
quantitated using a Hewlett-Packard (Palo Alto, CA) model 3365 Chemstation IL 
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5 Coupled Spectrophotometry and Spectrofluorometric Analysis 

* 

An aliquot of 20 jxg of crude recombinant Cphl apoprotein (Yeh and _ 
Lagarias (1998) Proc. Natl Acad. Set, USA, 95: 13976-13981) was added to 1 mL of bilin 
reductase assay mixture under green safelight. Mixtures were incubated for 30 min at room 
temperature to permit phytobilin binding. Phytochrome difference spectra were obtained as 

10 described previously (Terry and Lagarias (1991) 1 Biol Chem. 266: 22215-22221). A 
spectrofluorometric assay was used to detect the formation of intensely fluorescent 
phycoerythrobilin (PEB) adducts of Cphl (Murphy and Lagarias (1997) Curr. Biol 7: 870- 
876). Emission spectra were obtained with an excitation wavelength of 545 run using an 
SLM Aminco Bowman AB2 spectrofluorometer (Spectronic Instruments Inc., Rochester, 

15 NY). 

Example 3 

Production of Functional Phytochrome in Living Cells 

In the higher plants two enzymes are committed to the biosynthesis of 
phytochromobilin P<DB - the chromophore precursors of phytochrome. These enzymes are 

20 heme oxygenase (encoded by HY1 in Arabidopsis thaliana (Muramoto et al (1999) Plant 
Cell, 11: 335-347)), which catalyzes the ferredoxin-dependent conversion of heme to 
biliverdin EXa (BV) } and phytochromobilin:ferredoxin oxidoreductase (POB synthase) in 
Ardbidopsis (encoded by HY2 i&Arabidopsis), which catalyzes the ferredoxin-dependent 
conversion of BV to POB. A homolog of the HY1 protein, HOI which is encoded by 

25 Cyanobase Locus SLL1 184 of the cyanobacterium Synechocystis sp T PCC 6803 has been 
shown to be a functional ferredoxin-dependent heme oxygenase. Here we show that co- 
expression of the bio synthetic enzymes HOI and HY2 together with the cyanobacterial 
phytochrome Cphl yields the production of photoreversible holophytochrome in the 
bacterium Escherichia coll with spectroscopic properties consistent with the formation of a 

3 0 phytochromobilin-adduct 

This work involved the production of synthetic operon comprised of HOI 
from Synechocystis sp. PCC6803 and the mature HY2 coding region (mHY2) from 
Arabidopis thaliana that lacks the plastid targeting sequence. The cloning of HOI and 
mHY2 open reading frames into the plasmid pPROLarA122 (Clontech Laboratories) places 

35 this operon under regulatory control of a dual Ara/Lac promoter. Upon introduction of this 
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5 plasmid into E. coli cells harboring the Cphl -expression plasmid, pBAD/Cphl(514), in 
which Cphl (N5 14) is under regaulatory control of a Ara promoter, the production of 
photoactive ho lophyto chrome in vivo was determined. 

Methods. 

Plasmid Construction* 

10 The synthetic operon consisting of HO J and pcyA coding regions was cloned 

in the expression vector pPR01arA122 to produce the plasmid pPRGlarA122/H01-RBS- 
SLR01 16 (SEQ ID NO:27) as follows. The HOI gene from Synechocystis sp, PCC6803 
was first PGR amplified with the sense primer Phol-SIK, 5'~ATC GGT ACC ATG AGT 
GTC AAC TTA GCT TG-3' (SEQ ID NO; 2 8) (containing a Kpnl restriction site) and 

15 antisense primer Pbol-ArB, 5'-ATT GGA TCC TTT CTC CTC TTT AAC TAG CCT TCG 
GAG GTG GCG A-3' (SEQ ID NO: 29) (containing a synthetic ribosome binding site 
upstream of a BamHI restriction site) using chromosomal DNA from Synechocystis sp. 
PCC6803 as a template. The reaction was carried out using a standard reaction mix, Taq 
polymerase, and a 30 cycle run with an annealing temperature of 50°C The gene was then 

20 cloned into TA cloning plasmid, pCR2. 1 (Invitrogen), producing plasmid pCR2, 1/HOl- 
RBS (not shown). 

The synthetic operon consisting of HOI from Synechocystis sp PCC6803 
(Comejo et al (1998) Plant 1 15: 99-107) and mHY2 coding regions was produced by 
cloning mHY2 into the plasmid pCR2, 1/HOl-RBS to produce plasmid pCR2.1/H01-RBS- 

25 HY2. Specifically, the mHY2 cDNA from Arabidopsis was PCR-amplified using plasmid 
DNA from the clone pGEX-mHY2 (Example 2), which contains the full length mHY2 
cDNA minus the transit peptide in a GST-fusion vector, with sense primer mHY2-EcoRV: 
5'-CGG ATA TCA TGT CCC CTAT ACT A-3 1 (SEQ ID NO:30) and the antisense primer, 
mHY2-NotI: 5>-GCG CGG CCG CTT AGC CGA TAA ATT GTC C-3' (SEQ ID NO:31) 

30 under standard conditions. The reaction was carried out using a standard reaction mix, Pfu 
polymerase, and a 35 cycle run with an annealing temperature of 55°C. The PCR product 
was restricted with EcoRV and Notl and then subcloned in to the plasmid pCR2, l/HOl- 
RBS to produce the plasmid pCR2.1/HO!-RBS-HY2, Finally pCRZ 1/H01-RBS-HY2 
was restricted with KpnI/NotI and the resulting fragment was ligated with KpnI/NotI 
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restricted pPROLarAl22 (Clontech Laboratories) resulting in production of 
pPROLarA122/HO 1 -RBS-HY2, 

4 

E. coli strains, media, and transformation. 

The plasmid pPROLarA122/H01-RBS-HY2 was transformed into E. coli 
strain LMG194 (Invitrogen) competent cells containing the apophytochrome expression 
plasmid, pBAD/Cphl(514). Dual ampicillin and kanamycin selection using minimal RM 
media was performed to isolate transformants. 

Protein expression. 

The & coli strain LMG194 containing both plasmids pBAD/Cphl(5 14) and 
pPR01arA122/HOl-RBS-HY2 was grown overnight at 37°C in 3 ml RM media containing 
25 ^g/ml kanamycin and 50 jig/ml ampicillin, A 1 ml aliquot of this culture was transferred 
to 100 ml of RM media and grown at 37°C to an OD 6 oo of approximately 0. 5. 50 ml of this 
culture was then transferred to 450 ml LB media containing 25 jig/ml kanamycin and 50 
(ig/ml ampicillin, IPTG was added to a final concentration of 1 mM to induce expression of 
the synthetic operon. After incubation for 1 h at 30°C, arabinose was added to a final 
concentration of 0. 002% to induce expression of apoCphL The culture was grown at 30°C 
for 5 h, after which time cells were collected by centrifiigation and resuspended in 10 ml 
lysis buffer (50 mM Tris-HCl, pH 8. 0, 100 mM NaCl, 0. 05% v/v NP40, 2 ^ig/ml leupeptin, 
2 mM benzamidine, 2 mM PMSF, ImM DTT, 3 yg/ml pepstatin A). A cell lysate was 
obtained by lysing the cells at 10,000 psi with a French Press. After insoluble material was 
removed by centrifiigation, the crude homogenate was placed on ice at 4°C and examined 
for holophytochrome spectrophotometry ally. 

Protein Purification: 

The crude soluble fraction was run over a Talon (Clontech) metal affinity 
chromatography column (5 ml bed volume), washed with 20 ml extraction/wash buffer, and 
eluted with 2 bed volumes lx elution buffer (EW buffer containing 200 mM imidazole). 
The resulting solution was dialyzed overnight against 2 liters of 10 mM HEPES pH 7. 5, 
and then concentrated using an Amicon ultrafiltration cell. 
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5 Results & Discussion. 

After 5 h induction at 30°C, cultures containing both pBAD/Cphl(514) and 
pPROIarA122/H01-RBS-HY2 plasmids turned blue-green. As shown in the difference 
spectrum (Figure 15), co-expression of pBAD/Cphl(514) andpPR01arA122/H01-RBS- 
HY2 yielded crude cell extracts containing photoactive holophytochrome. The spectrum of 

10 purified holoCphl(N514) from these cells reveals absorption maxima for the Pr form at 660 
nm and for the Pfr form at 710 iun } consistent with the formation of a phytochromobilin 
(P<t>B) adduct in vivo, as opposed to the blue-shifted phycocyanobilin (PCB) adduct formed 
in Cphl(N514)-expressing E. coli cells coexpressing the PCB operon, Le. HOI andPcyA 
(Figure 16) (Yeh 5 et al (1997) Science, 277; 1505-1508), 

15 It is understood that the examples and embodiments described herein are for 

illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference in their entirety for all 

20 purposes. 
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CLAIMS 



What is claimed is: 



I . An isolated HY2 family bilin reductase comprising an amino acid 
consensus sequence as illustrated in Figure 5 or in Figure 10 and having bilin reductase 
activity. 

10 2, The bilin reductase of claim 1 , wherein said bilin reductase is not 

hvrccr or atrccrl. 

3 . The bilin reductase of claim 1 , wherein said bilin reductase is not 
rccr_horvu or rccrarath, 

4. The bilin reductase of claim 1, wherein said bilin reductase is not 
15 ycp2_synpy or ycp3_synpy. 

5. The bilin reductase of claim 1 , wherein said bilin reductase comprises 
at least 50% sequence conservation as shown in Figure 10. 

6. The bilin reductase of claim 1, wherein said bilin reductase comprises 
at least 70% sequence conservation as shown in Figure 10. 

20 7, The bilin reductase of claim 1, wherein said bilin reductase comprises 

at least 90% sequence conservation as shown in Figure 10. 

8. The bilin reductase of claim 1, wherein said bilin reductase comprises 
at least 80% sequence conservation as shown in Figure 5. 

9. The bilin reductase of claim 1, wherein said bilin reductase comprises 
25 at least 100% sequence conservation as shown in Figure 5. 

10. The bilin reductase of claim 1, wherein said bilin reductase is PebA, 

II. The bilin reductase of claim 1 , wherein said bilin reductase is PebB. 

12. A ferredoxin-dependent bilin reductase comprising at least 15% 
sequence identity with an enzyme selected from the group consisting of HY2_ARATH, 
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YCP2_SYNPY, YHP2_PROMA, YHP3PROMA, YCP3_SYNPY, SLR0116, 
PcyA_ANASP, PcyA_NOSPU, PcyA_SyNY3, PcyA_SYN8.1, PcyA_PROME, 
PebA_SYNPY, PebA^SYNS.l, PebAPROMA, PebAPROME, PewbB_NOSPU, 
HY2_ARATH, RCCR_ARATH, and RCCRJHORVU, and where, when aligned with HY2, 
comprises conserved hydrophobic residues at position 137, 157, 158, 256, and 314. 

13. The bilin reductase of claim 12, wherein said bilin reductase, when 
aligned with HY2, comprises a residue selected from the group consisting of Pro-151, Phe- 
221, Ser222, and ASP-171. 

14. The bilin reductase of claim 13, wherein said bilin reductase, when 
aligned with HY2, comprises Pro-151, Phe-221, Ser-222, and ASP-171. 

i 

15 15. The bilin reductase of claim 12, wherein said bilin reductase is not 

hvrccr or atrccrl. 

1 6 . The bilin reductase of claim 1 2, wherein s aid bilin reductase is not 
rccr_horvtt or rccr_arath. 

17. The bilin reductase of claim 12, wherein said bilin reductase is not 
20 ycp2_synpy or ycp3_synpy. 

18. The bilin reductase of claim 1 2, wherein said bilin reductase is not 

HY2. 

19. An isolated bilin reductase having bilin reductase activity and 
comprising an amino acid sequence of polypeptide selected from the group consisting of 

25 HY2, athy2, slrOl 16, c362„anab, ycp2-synpy, ycp3_synpy ? PcyA_ANASP, PcyA_NOSPU, 
PcyA__SYNY3, PcyAJSYN81, PcyA_PROME, PebA_SYNPY ? PebA_SYN81, 
PebAJPROMA, PebAJ>ROME, PebA^NOSPU, PebB SYNPY, PebB_SYN81, 
PebBJPROMA, PebB__PROME, PebB^NOSPU, HY2_ARATH, RCCR_ARATH ? and 
RCCRJHORVU, or conservative substitutions thereof, 

30 20. The bilin reductase of claim 19, wherein said bilin reductase 

comprises an amino acid sequence of a polypeptide selected from the group consisting of 
athy2, slrOlie, c362_anab, ycp2-synpy, ycp3_synpy, PcyA_ANASP, PcyAJNOSPU, 
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5 PcyA_SYNY3 5 PcyA_SYN81,PcyA_PROME J PebA_SYNPY s PebA_SYN81 ) 
P eb A_PROMA, PebAPROME, PebA_NOSPU, PebB_SYNPY, PebB_J3YN81, 
PebB_PROMA, PebB_PROME, PebB_NOSPU, HY2_ARATH, RCCR_ARATH, and 
RCCR_HORVU. 

21 . A method of converting a biliverdin to a phytobilin, said method 

10 comprising contacting a bilin reductase of claim 1 , with a biliverdin whereby said biliverdin 
is converted to a phytobilzn. 

22. The method of claim 19, wherein said bilin reductase is a 
cyanobacterial bilin reductase. 

23. The method of claim 19, wherein said bilin reductase is an algal bilin 

1 5 reductase. 

24. The method of claim 19, wherein said bilin reductase is a plant bilin 

reductase. 

w 

25. The method of claim 21, wherein said bilin reductase is 
recombinantly expressed, 

20 26, The method of claim 21, wherein said contacting is ex vivo. 

27. The method of claim 21, wherein said contacting is in a cell and said 
bilin reductase is a heterologous polypeptide. 

28. The method of claim 21, further comprising contacting said 
phytochromobilin with a second bilin reductase to produce a phytochrome. 

25 29, The method of claim 21, further comprising contacting said 

phytochromobilin with a second bilin reductase to produce a phytofluor. 

30. The method of claim 29, wherein said second bilin reductase is PebB. 

31. The method of claim 21, wherein said bilin reductase is ycp2-snpy. 

32. The method of claim 29, wherein said bilin reductase is ycp3-snpy, 
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5 33, A nucleic acid comprising a nucleic acid encoding a bilin reductase 

of any one of claims 1 through 20. 

34. The nucleic acid of claim 33, wherein said nucleic acid is a vector. 

35. ' A cell comprising a heterologous nucleic acid comprising a nucleic 
acid encoding a bilin reductase of any one of claims 1 through 20. 

10 36. The cell of claim 35, wherein said cell is selected from the group 

consisting of an algal cell, a plant cell, a yeast cell, a bacterial cell, an insect cell, and a 
mammalian cell. 

37. A nucleic acid comprising a nucleic acid that specifically hybridizes 
with a nucleic acid of any one of claims 1 through 20 under stringent conditions and that 

1 5 encodes a polypeptide having bilin reductase activity, wherein said nucleic acid does not 
encode an hvrccr or an atrccr polypeptide. 

38. The nucleic acid of claim 37, wherein said nucleic acid is a vector, 

39. A method of detecting expression of a polypeptide, said method 

comprising: 

20 providing a cell comprising a nucleic acid encoding an 

apophytochrome; and a nucleic acid encoding a bilin reductase that produces a phytobilin 
that assembles with said apophytochrome to produce a phytofluor; 

and detecting an optical signal produced by said phytofluor. 

40. A method of producing a photoactive holophytochrome, said method 

25 comprising: 

co-expressing in a cell: 

a heme oxygenase; 
an apophytochrome; 

and a feiredoxin-dependent bilin reductase; 
30 whereby said cell produces said photoactive holophytochrome and where one or more of 
said apophytochrome and said ferredoxin-dependent bilin reductase are expressed by 
heterologous nucleic acids. 

-80- 



0NSDOCID -cWO 0194548A2_I_> 



W O 01/94548 



PCT/US01/18326 



5 41. The method of claim 40, wherein said cell is selected from the group 

consisting of an algal cell, a yeast cell, a bacterial cell, a plant cell, an insect cell, and a 

■< 

mammalian cell. 

42. The method of claim 40, wherein said ferredoxin-dependent bilin 
reductase is an HY2 family bilin reductase. 

10 43. The method of claim 40, wherein said apophytochrome and said 

ferredoxin-dependent bilin reductase are both expressed by heterologous nucleic acids. 

44. The method of claim 40, wherein said heme oxygenase is expressed 
by a heterologous nucleic acid. 

45. The method of claim 40, wherein said photoactive holophyto chrome 
15 is not a phytofluor. 

46. The method of claim 45, wherein said ferredoxin-dependent bilin 
reductase is an HY2 family member. 

47. The method of claim 45, wherein said ferredoxin-dependent bilin 
reductase is HY2. 

20 48. The method of claim 45, wherein said ferredoxin-dependent bilin 

reductase is pcyA. 

49. The metho d of claim 40, wherein said photoactive holophyto chrome 

is a phytofluor, 

50 r The method of claim 49, wherein said apophytochrome is expressed 
25 as a fusion protein with a protein that is to be labeled with said phytofluor. 

5 1 . The method of claim 49, wherein said method comprises expressing 
the ferredoxin-dependent bilin reductase pebA oxpehB. 

52. The method of claim 51, wherein said method comprises expressing 
both ferredoxin-dependent bilin reductase pebA zndpebB, 
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5 53. The method of claim 51, wherein said cell is a bacterial cell. 

54. The method of claim 40, wherein said method farther comprises 
recovering said photoactive holophytochrome from said cell. 

55. A cell comprising: 

a heme oxygenase; 
10 an apophytochrome; 

and a ferredoxin-dependent bilin reductase; 
whereby said cell produces a photoactive holophytochrome and where one or more of said 
apophytochrome and said ferredoxin-dependent bilin reductase are expressed by 

* 

heterologous nucleic acids. 
15 56. The cell of claim 55, wherein said cell is selected from the group 

b, 

consisting of an algal cell, a yeast cell, a bacterial cell, a plant cell, an insect cell, and a 
mammalian cell. 

57, The cell of claim 55, wherein said ferredoxin-dependent bilin 
reductase is an HY2 family bilin reductase. 

20 58. The cell of claim 55, wherein said apophytochrome and said 

ferredoxin-dependent bilin reductase are both expressed by heterologous nucleic acids. 

59. The cell of claim 55, wherein said heme oxygenase is an endogenous 
heme oxygenase. 

60. The cell of claim 55, wherein said heme oxygenase is expressed by a 
25 heterologous nucleic acid. 

6 1 . The cell of claim 55, wherein said photoactive holophytochrorpe is 
not a phytofluor. 

62. The cell of claim 61, wherein said ferredoxin-dependent bilin 
reductase is an HY2 family member. 
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5 63. The cell of claim 61, wherein said ferredoxin-dependent bilin 

reductase is HY2. 

64. The cell of claim 6 1 , wherein said ferredoxin-dependent bilin 
reductase is pcyA. 



20 



65. The cell of claim 55, wherein said photoactive holophytochrome is a 



10 phytofluor. 



66. The cell of claim 65, wherein said apophytochrome is expressed as a 
fusion protein with a protein that is to be labeled with said phytofluor. 

67. The cell of claim 65, wherein said cell comprises expressing the 
ferredoxin-dependent bilin reductase pebA or pebB. 

1 5 68 . The cell of claim 67, wherein said cell comprises expressing both 

ferredoxin-dependent bilin reductase pebA and pebB\ 

69. The cell of claim 67, wherein said cell is a bacterial cell. 
70, 



A recombinant nucleic acid comprising: 
a nucleic acid encoding a heme oxidoreductase; and 
a nucleic acid encoding and a ferredoxin-dependent bilin reductase; 
where said nucleic acid expresses a functional heme oxidoreducase and a functional bilin 
reductase. 

71, The nucleic acid of claim 70, wherein said heme oxidoreductase and 
said bilin reductase are under control of the same promoter, 

25 72. The nucleic acid of claim 71, wherein said promoter is a constitutive 

promoter* 



73. The nucleic acid of claim 71, wherein said promoter is an inducible 



promoter. 



74. The nucleic acid of claim 7 1 , wherein said promoter is a tissue- 
30 specific promoter. 
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5 75 . The nucleic acid of claim 70, wherein said nucleic acid is present in a 

cell 

76. The nucleic acid of claim 75, wherein said cell is selected from the 
group consisting of an algal cell, a bacterial cell, a plant cell, a yeast cell, a mammalian cell, 
and an insect cell. 

10 77. The nucleic acid of claim 75, wherein said nucleic acid comprises a 

gene selected from the group consisting of HOI, HY2, PcyA, PebA, arid PebB. 

78. The nucleic acid of claim 77, wherein said nucleic acid comprises an 
HOI coding region and a pcyA coding region. 

79. The nucleic acid of claim 78, wherein said nucleic acid further 
1 5 comprises a pcyB. 
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aattt gt gunnatt tt at CtCt fct t at agata a AG AATCTTGCTTTTTTCAGTS TTC ACTA 180 
TGJU^AAGAATTGAAGAGA.GTGTCCGAGGAAGGftGACCTTTGGTTTCAGTTTCGTGAGTCT 240 

TGTTOTAATGGCTTrEATCAAIUGGAGTTTGGGTro^ 300 
/f A Jj SMEFGFSXGS C P K A P 

AAACCCACCTGTTCTAATCTCTGCAAGCCCTAA 360 

AAAGAAAAGATTCTTAC.TTAGAGTCTCTGCTGTGTCGTATAAGG AA ITHCGCAGAGTCTGC 420 

AyJ-lOtf ***** 

J»rtffl.F££IlV5AV5YKEFAESA 
TXTAGAASAAACC AG GAAAAG GAT C G WSCTSG AACCT T CA CA£ C CAG gt at at gca at 480 

taaatttagtiagtgtagtgggaggatfcafcattfcctcattgtttGfcfcgotgtgan-fctttg 54 0 
ggtaaattgatttgagttgtcattftggaaccaftpcaaataactttaatgttatagactgc; 600 

ttAtatsagtaaAagttcagattttgtttttcrtaatsaogaaaetgtttcagGAAAAGTA 660 

E *C X 

TAGTAG C ATGAC AG GAC TAG ATG GTAAGAC CG AAC TTC AAATG C M G CTTTI! AAATCT TC 720 
S SMTGLDOKTELQMLAr KSS 

AAAGATTAGACTCTTGAGGAGTATGGCAATAGAGAATGAGACAAKJCAGgtttaacttca 730 
KIRLLRSWAIENETHQ 

gcagtacaaaafcgatfcgctttogtcccattt.ccst-tafltt.toaattgattgat-tgtt'tgta S4 0 

tctt^gcttagGTCrTTGACTTTGCGCGTTTCATGGAGCCirGAGTATGATACTCCCATAT? »D0 

iiy.2-i,ijy*-.l04 T 

1TCTGTGCEAACTT1'TTCACATCTACCAAC&^ 960 
CANFFTSrTNVMlVVi 

agttatgctggagttatcaggtctgt^ttgtccaaactgfttgttoaatatt-ttaatg'tat 1020 

gtbcttCttt&gGGACCTTAATCCTT^GCATCAGTI'GACTGACCAGACGGA'rTACCAAGA 1080 

CAAGTATTATAACAAGATAATGTCCATATATCACAAA'rATGCTGAGgfcgaccJIGaagaat 1140 
KYVtfXIHSIXBKYAE 

acacaaaattaetcaattgcaagteiaaccit.Aa-tgctgfiggtgtaafltga^tgatc-ttgag 12 00 

atttatttgcagACTrCCCCATGGGGAGGGAAATTGACTGGTGAATCCATAAAGTTTOTC 12 60 

hy2-101 A 

Ilf^WGGKLTGESIKFF 

TGGCCTarrGGTGATGTCGACTAGGCTTTCGTCTAGCAAAGAAAAACATAAGGCTTTGTTC 1320 
SPtVMWTRySSSKEKHRALF 

TCtrGCG^^CTAGAGXACTATCAGgtatatatJteogcggecaaaagctaaggtti^attg 13B0 

gaaactttgactgagaatatatcatct:tcttc?c?t«cagGCATGGCT!rGAGATGACAAi:CG 144 0 

hy2-107 a 

AWLBHT'IQ 

AAGTGAGGGAGGAGATGGAACCATCTCATGXGAGAGCCAATraTGAAGCACAACACAAGT 15 00 
VH ES HEPSHVRASCEACJiKY 

ACCTGACATGGCGAGCACAAAAGgtgatttcatttccttttgtgtaatttgcatgtttga 15 60 
hr2-X03 A 
I< T W R A Q K 

aGagacactgttttctgtattgttacaAtggatattgatttggtgtttgcagGATCCrGGA 1620 

a 

D P G 

CATGGTCCXCTOAAAAGATTAG!TAGGTGAAGCAAA<5GCAAAGgt a t aaaagftt tt qatcc 1660 
JiGLIiKKLVGEAKAK 

cottagiigtfiCGcattatfcaattagGttgtgaagatgttgaaaatgatttgaacaaaatc 174 0 

agGAGC^CEAACIGGATtf !TCCTG£!rC^ Ifi 0 0 

ELLRDFI.FHGVDE T.G'TKT F I 

TTaATTACTTTCCAGAGTACCAAACAGAAGATWAACTGTAAGCGATAAACGAAGTArCA 1 B6 0 
D^FPEYflTBDGTVSDKR$ri 

TrGGGAAGTCATATGAAACTCGTCCATGGGATItf^ 1920 
GUSXBTRPWDLTGQFIG 

GATATATGTGAACAAGTCAGATTTCAGAGTCArCAACACAAGAGGACGTGAACTTAGGGA 1980 

ACtTAGGAAtTAAG AAAGAGCAGCATGAGGAGIC TCTCAGGTCTATCTGCATTTC AAGATGA 2040 

lT?G3TCttGAGl^ACCA*lrGCATTGIAGEETO^ 21 D 0 

GAGAATCCTCGAGTATGATATGATTTTAATGAAAATGTATTCGTCrCTacGtaatcaa<5J4 21 50 
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SEQUENCE LISTING 



SEQ ID NO:27. 

DNA sequence of expression plasitndpPROLaxA122/HOl-RBS"SLR0116 



CAGAATTCATTAA^GAGGAGAAAGGTACCATGAGTGTCAACTTAGCTTCCCAGTTGCGGGAAGGGACGAAAAAA 
TCCCACTCCATGGCGGAGAACGTCGGCTTTGTCAAATGCTTCCTCAAGGGCGTTGTCGAGAAAAATTCCTACCG 
TAAGCTGGTTGGCAATCTCTACTTTGTCTACAGTGCCATGGAAGAGGAAATGGCAAAATTTAAGGACCATCCCA 
TCCTCAG CC AC ATTTACTTCCCCG AACTCAACCGCAAAC AAAG CCTAGAGC AAGACCTGCAATTCTATTACGGC 

15 TCCAACTGGCGGCAAGAAGTGAAAATTTCTGCCGCTGGCCAAGCCTATGTGGACCGAGTCCGGCAAGTGGCCGC 
TACGGCCCCTGAATTGTTGGTGGCCCATTCCTACACCCGTTACCTGGGGGATCTTTCCGGCGGTCAAATTCTCA 
AGAAAATTGC C CAAAATGC CATG AATCTC CACGATGGTGGCACAGCTTTGTATGAATTTG C CGACATTG ATGAC 
GAAAAGGCTTTTAAAAATACCTACCGTCAAGCTATGAATGATCTGCCCATTGACCAAGCCACCGCCGAACGGAT 
TGTGGATGAAGCCAATGACGCCTTTGCCATGAACATGAAAATGTTCAACGAACTTGAAGGCAACGTGATCAAGG 

20 CGATCGGCATTATGGTGTTCAACAGCCTCACCCGTCGCCGCAGTCAAGGCAGCACCGAAGTTGGCCTCGCCACC 
TCCGAAGGCTAGTTAAAGAGGAGAAAGGATCCATGGCCGTCACTGATTTAAGTTTGACCAATTCTTCCCTGATG 
CCTACGTTGAACCCGATGATTCAACAGTTGGCCCTGGCGATCGCCGCTAGTTGGCAAAGTTTACCCCTCAAGCC 
CTATCAATTGCCGGAGGATTTGGGCTACGTAGAAGGCCGCCTGGAAGGGGAAAAGTTAGTGATTGAAAATCGGT 
GCTACCAAACGCCCCAGTTTCGCAAAATGCATTTGGAGTTGGCCAAGGTGGGCAAAGGGTTGGATATTCTCCAC 

25 TGTGTAATGTTTCCTGAGCCTTTATACGGTCTACCTTTGTTTGGCTGTGACATTGTGGCCGGCCCCGGTGGAGT 
AAGTGCGGCTATTGGGGATCTATCCCCCACCCAAAGCGATCGCCAATTGCCCGCAGCGTACCAAAAATCATTGG 
CAGAGCTAGGCCAGCCAGAATTTGAGCAACAACGGGAATTGCCCCCCTGGGGAGAAATATTTTCTGAATATTGT 
TTATTCATGGGTCCCAGCAATGTCACTGAAGAAGAAAGATTTGTACAAAGGGTAGTGGACTTTTTGCAAATTCA 
TTGTCACCAATGCATCGTTGCCGAACCCTTGTCTGAAGCTCAAACTTTGGAGCACCGTCAGGGGCAAATTCATT 

30 ACTGCCAACAACAACAGAAAAATGATAAAACCCGTCGGGTACTGGAAAAAGCTTTTGGGGAAGCTTGGGCGGAA 
CGGTATATGAGCCAAGTCTTATTTGATGTTATCCAATAATCTAGAGGCATCAAATAAAACGAAAGGCTCAGTCG 
AAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGCCGGCCTA 
GACCTAGGGGATATATTCCGCTTCCTCGCTCACTGACTCGCTAGGCTCGGTCGTTCGACTGCGGCGAGCGGAAA 
TGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGG 

35 CAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGA 
AACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCGTGGCGGCTCGCTCGTGCGCTCTCCTGTTCCTGCCTT 
TCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACAGTCAGTTCCGGGTAG 
GCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTA 
TCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCGACTGGTAATTGATTTAGAGGAG 

40 TTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCA 
GTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGGAAGGCGGTTTTTTCGTTTT 
CAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAATCAGATAAAATATTACTAG 
ATTTCAGTGCAATTTATCTCTTCAAATGTAGCACCTGAAGTCAGCGCCATACGATATAAGTTGTTACTAGTGCT 
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TGGATTCTCACCAATAAAAAACGCCCGGCGGCAACCGAGCGTTCTGAACAAATCCAGATGGAGTTCTGAGGTdA 
TTACTGGATCTATCAACAGGAGTCCAAGCGAGCTCTCGAACCCCAGAGTCCCGCTCAGAAGAACTCGTCAAGAA 

■ 

GGCGATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGATACCGTAAAGCACGAGGAAGCGGTCAGCCCATTCG 
CCGCCAAGCTCTTCAGCAATATCACGGGTAGCCAACGCTATGTCCTGATAGCGGTCCGCCACACCCAGCCGGCC 
ACAGTCGATGAATCCAGAAAAGCGGCCATTTTCCACCATGATATTGGGCAAGCAGGCATCGCCATGGGTCACGA 
CGAGATCGTCGCCGTCGGGCATGCGCGCCTTGAGCCTGGCGAACAGTTCGGCTGGCGCGAGCCCCTGATGCTCT 
T CGT C CAG ATCATCCTG ATCGACAAGAC CGGCTTC CAT C CGAGTACGTG CTCGCTCGATGC GATGTTTCGCTTG 

GTGGTCGAATGGGCAGGTAGCCGGATCAAGCGTATGCAGCCGCCGCATTGCATCAGCCATGATGGATACTTTCT 
CGGCAGGAGCAAGGTGAGATGACAGGAGATCCTGCCCCGGCACTTCGCCCAATAGCAGCGAGTCCCTTCCCGCT 
TCAGTGACAACGTCGAGCACAGCTGCGCAAGGAACGCCCGTCGTGGGCAGGCACGATAGCCGCGCTGCCTCGTC 
CTGCAGTTCATTCAGGGCACCGGACAGGTCGGTGTTGACAAAAAGAACCGGGCGCCCCTGCGCTGACAGCCGGA 
AGACGGCGGCATCAGAGCAGCCGATTGTGTGTTGTGCCCAGTCATAGCGGAATAGCCTCTCCACCCAAGCGGCC 
GGAGAACCTGCGTGCAATCCATCTTGTTCAATCATGCGAAACGATCCTCATCCTGTCTCTTGATCAGATCTTGA 
TCCCCTGCGCCATCAGATCCTTGGCGGCAAGAAAGCCATCCAGTTTACTTTGCAGGGCTTCCCAACCTTACCAG 
AGGGCGC CCC AGCTGGC AATTC CG ACGTCTGTGTG G AATTGTG AG CGGATAAC AATTTC AC AC AGGG CCCTCGG 
ACACCGAGGAGAATGTCAAGAGGCGAACACACAACGTCTTGGAGCGCCAGAGGAGGAACGAGCTAAAACGGAGC 
TTTTTTGCCCTGCGTGACCAGATCCCGGAGTTGGAAAACAATGAAAAGGCCCCCAAGGTAGTTATCCTTAAAAA 
AGCCACAGCATACATCCTGTCCGTCCAAGCAGAGGAGCAAAAGCTCATTTCTGAAGAGGACTTGTTGCGGAAAC 
GACGAGAACAGTTGAAACACAAACTTGAACAGCTACGGAACTCTTGTGCGTAAGGAAAAGTAAGGAAAACGATT 
CCTTCTAACAGAAATGTCCTGAGCAATCACCTATGAACTGTCGACTCGAGCATAGCATTTTTATCGATAAGATT 

AGCGGATCTAACCTTTACAATTGTGAGCGCTCACAATTATGATAGATTCAATTGTGAGCGGATAACAATTTCAC 
A 
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